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Remarks 

Reconsideration of this Application is respectfully requested. 

Upon entry of the foregoing amendment, claims 1-6, 11-12, and 45-77 are 
pending in the application, with claims 1-4, 45, 54, 64 and 74 being the independent 
claims. Claims 1 1 and 13-44 are sought to be cancelled without prejudice to or 
disclaimer of the subject matter therein. New claims 45-77 are sought to be added. 
These changes are believed to introduce no new matter, and their entry is respectfully 
requested. 

New claims 45-73 correspond to cancelled claims 14-22, 24-33, and 35-44. For 
example, new claims 45, 54, and 64 are redrafted forms of cancelled claims 14, 24, and 
35, respectively, which have been rewritten to more clearly define Applicants' invention. 
Support for new claims 45-73 can be found throughout the specification as filed, inter 
alia, at page 26, line 27, to page 33, line 25; at page 34, lines 21-28; in Example 8, at 
pages 67-69; and in Figure 3D and Figure 13D. 

New claims 74-77 are drawn to a method for identifying a compound having the 
potential to inhibit cytokinesis by determining the compounds ability to inhibit CYK-4 
function by determining the compound^ ability to inhibit self association of MKLP1 
protein subfamily members. Support for new claims 74-77 can be found in the 
specification, inter alia, at page 32, lines 6-22; and at page 33, line 26, to page 34, line 
27. 

Applicants have also amended the specification at page 30, lines 19-22, to add a 
sequence identifier number after each the two listed examples of the MLKP1 family of 
proteins, CeM03D4.1b and HsMKLPl, both of which are currently identified by 
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GenBank accession numbers. The specification has been amended to insert SEQ ID 

NO:7 after the description of CeM03D4.1b, and SEQ ID NO:8 after the description of 

HsMKLPl. A substitute sequence listing, in the form of substitute Sequence Listing 

sheets and a substitute computer readable copy, is also submitted herewith containing the 

amino acid sequences for SEQ ID NO:7 and SEQ ID NO:8. 

Pursuant to 37 C.F.R. § 1 .825(a) and (b), Applicants assert that the substitute 
sequence listing sheets contain no new matter, and that the copy of the sequence listing 
in the computer readable form is the same as the substitute copy of the sequence listing 
submitted herewith. 

SEQ ID NO: 7 and SEQ ED NO: 8 represent the amino acid sequences of 
CeM03D4.1b and HsMKLPl, respectively, present as the cited GenBank accession 
numbers at the time of filing of the application. Accordingly, Applicants assert that the 
addition of these two amino acid sequences to the application's Sequence Listing, with 
insertion of sequence identifiers in the corresponding text of specification, does not 
constitute the addition of new matter because both sequences are inherent in the 
GenBank database accession numbers cited in the application as originally filed. Thus, 
support for SEQ ID NO: 7 and SEQ ID NO: 8 can be found inherently in the specification 
as originally filed. 

Support for the addition of SEQ ID NO:7 and SEQ ED NO:8 can also be found in 
the printouts of the sequences as they existed in the GenBank database at the time of 
filing of the current application (June 18, 2001) and at the time of filing of the earliest 
priority application (EP 00 1 12 880.0, filed June 19, 2000). Applicants thus submit 
herewith the GenBank database printouts of the SEQ ED NO:7 and SEQ ID NO:8 
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sequences as they existed at the time of filing of the application, as Exhibit 1 (GenBank 

Accession Number U61955, protein ID 1397342 for CeM03D4.1b; submitted June 26, 

1998) and Exhibit 2 (GenBank Accession Number X67155, SwissProt Q02241 for 

HsMKLPl; submitted January 19, 2000). Applicants also submit printouts from the 

GenBank database listing the revision histories for SEQ ID NO:7 (Exhibit 3 for 

GenBank Accession No. U61955) and SEQ ID NO:8 (Exhibit 4 for GenBank Accession 

No. X67155), indicating that the sequences submitted as Exhibits 1 and 2 are the 

sequences existing as the cited GenBank accession numbers at the time of filing of the 

application. Exhibits 1-4 are also cited and submitted in Applicants' First Supplemental 

Information Disclosure Statement submitted herewith. 

These changes are believed to introduce no new matter, and their entry is 
respectfully requested. 

Based on the above amendment and the following remarks, Applicants 
respectfully request that the Examiner reconsider all outstanding objections and 
rejections and that they be withdrawn. 

7. Rejection under 35 U.S.C. § 112, First Paragraph 

The Examiner rejects claim 26 under 35 U.S.C. § 1 12, first paragraph, as 
allegedly containing subject matter which was not described in the specification in such a 
way as to enable one of skill in the art to which it pertains, or with which it is most 
nearly connected, to make and/or use the invention. (Office Action, at paragraph IV, 
pages 2 and 3.) 
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Specifically, the Examiner alleges that the sequences of the MKLP1 subfamily 
members recited in claim 26 are essential to practice the claimed invention and that 
without the sequence information, one cannot practice the claimed invention (Office 
Action, at page 3, lines 6-7.) The Examiner further asserts that because the sequences 
cannot be incorporated by reference other than to issued US patents or US patent 
applications to be issued, and because, in the present case, the MKLP1 family member 
sequences recited in claim 26 are represented by GenBank database numbers, Applicant 
is required to amend the disclosure to include the material incorporated by reference 
(Office Action, at page 3, lines 8-13). The Examiner also states that the amendment 
must be accompanied by an affidavit or declaration executed by the applicant, or a 
practitioner representing the applicant, stating that the amendatory material consists of 
the same material incorporated by reference in the referencing application. (Office 
Action, at page 3, lines 13-16.) 

To expedite prosecution and without acquiescing in the propriety of the rejection, 
Applicants have canceled claim 26. Applicants, however, have added new claim 56, 
which corresponds to cancelled claim 26 and which recites the same GenBank database 
accession numbers as cancelled claim 26. Applicants assert that the sequences of 
CeM03D4.1b and HsMKLPl were known in the art and publicly available at the time of 
filing of the application through the GenBank database accession numbers disclosed in 
the specification. The GenBank database accession numbers are currently the same as at 
the time of filing of the application, and the sequences for these proteins are publicly 
available. Applicants therefore believe that the sequences for these proteins are not 
essential material and do not need to be incorporated into the application. 
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To expedite prosecution and without acquiescing in the propriety of the 
Examiner's rejection, however, Applicants have amended the current specification at 
page 30, lines 19-22, to add the sequence identifier number SEQ LD NO:7 after the 
description of CeM03D4.1b, and SEQ ID NO:8 after the description of HsMKLPl, the 
two examples of MLKP1 proteins listed in the specification, both of which are currently 
identified by GenBank accession numbers. Applicants also submit herewith a substitute 
sequence listing, in the form of substitute Sequence Listing sheets and a substitute 
computer readable copy, containing the amino acid sequences for SEQ ID NO: 7 and 
SEQ ED NO:8, which represent the amino acid sequences of CeM03D4.1b and 
HsMKLPl, respectively, present in the cited GenBank accession numbers at the time of 
filing of the application. 

Support for the addition of SEQ ID NOs. 7 and 8 can be found inherently in the 
application as originally filed, in the GenBank database accession numbers disclosed in 
the specification. As evidence that SEQ ID NO:7 and SEQ ID NO:8 represent the 
sequences for the CeM03D4.1b and HsMKLPl proteins as they existed in the GenBank 
database at the time of filing of the application, Applicants submit herewith the GenBank 
database printouts for the CeM03D4.1b and HsMKLPl protein sequences (SEQ ID 
NO:7 and SEQ ID NO:8, respectively) as they existed at the time of filing of the 
application, as Exhibit 1 (GenBank Accession Number U61955, protein ID 1397342 for 
CeM03D4.1b; submitted June 26, 1998) and Exhibit 2 (GenBank Accession Number 
X67155, SwissProt Q02241 for HsMKLPl; submitted January 19, 2000). Applicants 
also submit printouts from the GenBank database listing the revision histories for 
GenBank Accession No. U61955 (SEQ ID NO:7), as Exhibit 3, and GenBank Accession 
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No. X67155 (SEQ ID NO:8), as Exhibit 4, indicating that the protein sequences 

submitted in Exhibits 1 and 2 are the sequences existing at the time of filing of the 

application. Exhibits 1-4 are also cited and submitted in Applicants* First Supplemental 

Information Disclosure Statement submitted herewith application. 

Applicants believe that the rejection of claim 26 under 35 U.S.C.§1 12, first 

paragraph, has been overcome and respectfully request the Examiner to reconsider and 

withdraw this rejection. 

IL Rejections under 35 U.S.C. § 112, Second Paragraph 

The Examiner rejects claims 13-33 under 35 U.S.C. § 1 12, second paragraph, as 
allegedly being indefinite for failing to particularly point out and distinctly claim the 
subject matter which applicants regard as the invention. (Office Action, at paragraph V, 
pages 4-5.) 

Specifically, the Examiner asserts that claims 13-33 are indefinite because the 
steps of the methods recited in claims 14 and 24 do not correlate to the preamble of the 
claims, which is identifying a compound capable of modulating cytokinesis, and because 
the essential steps in claims 13 and 23 are missing. (Office Action, at page 4, lines 12-13 
and 16.) The Examiner also states that "it is unclear from the specification that there is a 
direct link between 'modulating cytokinesis* and 'modulating the ability of a CYK-4 GAP 
to promote GTP hydrolysis by a Rho family GTPase.'" (Office Action, at page 4, lines 
13-15.) 

To expedite prosecution and without acquiescing in the propriety of the rejection, 
Applicants have cancelled claims 13-33. Applicants have added new claims 45-63, 
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however, which are redrafted forms of cancelled claims 14-22 and 24-33. In particular, 
new claims 45 and 54 are redrafted forms of cancelled claims 14 and 24, respectively, 
which have been rewritten to more clearly define the claimed methods. For example, 
each of new claims 45 and 54, which correspond to cancelled claims 14 and 24, 
respectively, has also been redrafted to recite in its preamble a method for identifying a 
compound having the potential to inhibit cytokinesis (rather than a method for 
identifying a compound capable of modulating cytokinesis, as recited in cancelled claim 
14) either by determining the compound's ability to inhibit the function of CYK-4 protein 
or fragment thereof to promote GTP hydrolysis by a Rho family GTPase (claim 45), or 
by determining the compound's ability to interfere with the function of CYK-4 protein or 
fragment thereof to bind to members of the MKLP1 subfamily of kinesin-like proteins 
(claim 54). New claims 45 and 54 have also been redrafted to more clearly define the 
recited steps and to specifically correlate the steps to the preamble of the claim. 

With respect to the Examiner's statement that it is allegedly unclear from the 
specification that there is a " direct link between 'modulating cytokinesis' and 
'modulating the ability of a CYK-4 GAP to promote GTP hydrolysis by a Rho family 
GTPase'" (Office Action, at page 4, lines 13-15), Applicants direct the Examiner's 
attention to the specification, at page 9, line 19, to page 15, line 2, and, in particular, at 
page 9, line 21, to page 10, line 2. 

The Examiner also asserts that claims 13-15, 20-27, and 31-33 are allegedly 
indefinite because they recite "CYK-4" and "MLKPl," which the Examiner identifies as 
acronyms. (Office Action, at page 4, lines 18-19.) 
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To expedite prosecution and without acquiescing in the propriety of the rejection, 
Applicants have cancelled claims 13-15, 20-27, and 31-33. Applicants have added new 
claims 45-63, however, which are redrafted forms of the cancelled claims and which 
recite "CYK-4" or "MLKP1." 

Applicants submit that the terms "CYK-4" and "MLKP1" do not render the 
claims indefinite because one of ordinary skill in the art could interpret the metes and 
bounds of the claims reciting these terms. As described in the specification at page 5, 
lines 1 1-25, "CYK-4" refers to a protein represented by the CYK-4 protein encoded by 
the C elegans cyk-4 gene, which derives its name from the C. elegans cyk-4 locus. See 
Gonczy, P. et aL, J. Cell Biol. 144:927-946 (March 1999) and Jantsch-Plunger, V. et al., 
J. Cell Biol. 749:1391-1404 (June 2000) (previously submitted as document nos. AT9 
and AS 13 in the Information Disclosure Statement submitted by Applicants on January 
18, 2002). Sequences for the human and mouse CYK-4 proteins are provided in the 
specification as SEQ ID Nos. 2 and 4, respectively. The specification, in Example 2, at 
page 50, lines 1-22, also describes several domains of C. elegans CYK-4 that are shared 
by mouse, human, and Drosophila forms of CYK-4 and thus conserved among 
metazoans, indicating a general structure for CYK-4 protein: a conserved C-terminus 
containing a GTPase activating protein (GAP) domain; a CI domain predicted to bind to 
diacylglycerol or phorbol esters; and an amino terminal coiled-coil domain that mediates 
the interaction of CYK-4 with ZEN-4/MKLP 1 (see also page 14, line 18, to page 15, line 
11; and page 31, lines 2-5). Figure 3D and Figure 13D also present schematic diagrams 
showing the similar domain structure of the C elegans CYK-4 protein and its human 
analog, MgcRacGAP/human CYK-4 ("HsCYK-4"). Thus, one of skill in the art would 
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recognize the proteins represented by "CYK-4" and, consequently, could interpret the 

metes and bounds of a claim containing the term "CYK-4." 

Similarly, one of one of ordinary skill in the art would recognize the proteins 
represented by the term "MKLP1 subfamily of proteins" recited in the claims at issue. 
"MKLP1" represents human mitotic kinesin-like protein 1, a member of the kinesin 
family of motor proteins, which is known to those of skill in the art by the term 
"MKLP1." See Lee et al, Mol Cell Biol 75:7143-7151 (1995), at page 7144, upper left 
column (previously cited and submitted as document AR16 in Applicants' Information 
Disclosure Statement, filed on January 18, 2002). Analysis of the conserved kinesis 
motor domain has been used to construct subfamilies of kinesin proteins, one of which is 
the MKLP1 subfamily, the members of which are known to those of skill in the art. See, 
for example, Raich, W.B., et al,Mol Biol Cell P:2037-2049 (1998); Powers, J., et al, 
Curr. Biol 8:1 133-1 136 (1998); and Adams, R.R. et al, Genes & Development 
72148301494 (1998) (previously cited and submitted as document nos. AS21, AT20, and 
AR1 Applicants' January 18, 2002, Information Disclosure Statement). Thus, one of 
ordinary skill in the art would recognize the protein represented by "MKLP1 subfamily 
of proteins" and, as a consequence, could interpret the metes and bounds of a claim 
containing the term "MKLP1." 

Thus, use of the terms "CYK-4" and/or "MKLP1" in the claims at issue therefore 
does not render the claims indefinite. 

The Examiner further asserts that claims 23-33 are allegedly indefinite because 
each claim recites the term "interact" or "interaction," and that there is allegedly no 
unambiguous definition of the term such that one skilled in the art would know the metes 
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and bounds of the term. (Office Action, at page 5, lines 8-12.) The Examiner states that 
"[i]f the term interact' is intended to mean 'binding 1 , the claims should be amended so." 
(Office Action, at page 5, lines 12-13.) 

To expedite prosecution and without acquiescing in the propriety of the rejection, 
Applicants have cancelled claims 23-33. Applicants have added new claims 55-63, 
however, which are redrafted forms of the cancelled claims and which substitute the term 
"bind" or "binding" for the terms "interact" or "interaction," respectively. 

The Examiner also asserts that claims 13, 23, and 25 are allegedly indefinite 
because claim 13 recites "a GAP domain" and claims 23 and 25 recite "a domain of said 
CYK-4 GTPase activating protein." The Examiner states that "[s]ince neither the art nor 
the specification defines the terms unambiguously, the claims are indefinite." (Office 
Action, at page 5, lines 14-17.) 

To expedite prosecution and without acquiescing in the propriety of the rejection, 
Applicants have cancelled claims 13, 23, and 25. However, Applicants have added new 
claims 45, 54, and 55, which are redrafted forms of the cancelled claims 13, 23, and 25, 
respectively. Cancelled claim 13 recited "a GAP domain," while new claim 45 recites in 
step (ii) "wherein the CYK-4 protein or fragment thereof comprises a GTPase activating 
protein domain." The GTPase activating protein domain, which can also be referred to 
as a "GAP" domain (see and page 10, lines 7-10, of the specification at), is described in 
the specification at page 3, line 21, and at page 10, lines 7-10, and is illustrated in 
Figures 3D and 1 3D, contrary to the Examiner's assertion. 

New claim 54 recites in (i) the phrase "comprises a domain that binds MKLP1 
subfamily proteins," which is a redrafted form of the phrase "a domain of said CYK-4 
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GTPase activating protein that interacts with said member of the MKLP1 subfamily of 

proteins 11 in cancelled claim 23. Again, contrary to the Examiner's assertion, the 

specification describes the domain of CYK-4 the binds MKLP1 subfamily proteins. See, 

for example, the specification at page 31, lines 13-15; at page 32, lines 12-15; Figure 3D; 

and Example 8, at pages 67-69. 

Lastly, new claim 55 recites "that comprises a domain that binds the CYK-4 
protein," which is a redrafted form of the phrase "that comprises a domain that interacts 
with said CYK-4 GTPase activating protein" in cancelled claim 25. The latter phrase is 
not the same as the phrase "a domain of said CYK-4 GTPase activating protein that 
interacts with said member of the MKLP1 subfamily of proteins" referred to by the 
Examiner in cancelled claim 23. The phrase "that comprises a domain that binds the 
CYK-4 protein" in new claim 55 refers to the protein domain in MKLP1 that binds to 
CYK-4, which is described in the specification at page 6, lines 3-24; at page 15, lines 3- 
28; in Example 9, at pages 69-70 (MKLP1 is an ortholog of ZEN-4); and in Figure 14C. 

The Examiner also asserts that claim 30 is allegedly indefinite because it recites 
"the N-terminal region containing amino acids 1-120" and, according to the Examiner, it 
is allegedly "unclear what is referred by the term." (Office Action, at page 5, lines 18- 
19.) 

To expedite prosecution and without acquiescing in the propriety of the rejection, 
Applicants have cancelled claim 30. However, Applicants have added new claim 60, 
which is a redrafted form of the cancelled claim 30. New claim 60 recites "wherein the 
fragment of the CYK-4 protein comprises amino acid residues 1-120." Support for new 
claim 60 can be found in the specification in Example 8, pages 67-69, and in Figure 3D. 
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Applicants believe that the rejection of claims 13-33 under 35 U.S.C.§1 12, 

second paragraph, has been overcome or rendered moot and respectfully request the 

Examiner to reconsider and withdraw the rejections. 

IIL Claim Objections 

The Examiner objects to claims 13-16, 18, 20-26, 28, and 30-33 because they 
allegedly recite non-elected subject matter. (Office Action, at paragraph VI, pages 5-6.) 

Specifically, the Examiner states that that "[w]hile claims that recite more than 
one elected species are permitted in an application during prosecution, those claims will 
remain objected to during prosecution." (Office Action, at page 6, line 5-7.) The 
Examiner states further that he "clarifies that the restriction to a single CYK-4 
polypeptide in Paper No. 10 (April 17, 2003) is not a species election; rather it sets forth 
additional invention groups. In the instant case, Applicants elected human CYK-4 
polypeptide." (Office Action, at page 6, line 7-10.) 

Applicants request clarification regarding the Examiner's restriction to HsCYK-4 
or murine CYK-4 and respectfully ask the Examiner to confirm that he is examining all 
generic linking claims reciting both human and murine CYK-4 as required under Section 
809.03 of the Manual of Patent Examining Procedure (MPEP § 809.03 (May 2004)). 
Because, with the current amendment, claims 13-16, 18, 20-26, and 30-33 are cancelled, 
the generic claims at issue are new claims 45-63, i.e., those claims elected for 
prosecution that recite CYK-4 generically and thus encompass both murine and human 
forms of the CYK-4 protein. 
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Although Examiner states in the pending Office Action that the claims have been 
restricted to either the human or murine form of CYK-4 in Paper No. 10 (April 17, 2003) 
and that this restriction was not an election of species, Applicants note that the Examiner 
has objected to the claims as containing non-elected matter, as if the restriction were an 
election of species. The Examiner has also not indicated on the face page of the pending 
Office Action that the claims are withdrawn. 

According to MPEP § 809.03, if an application contains genus claims that link 
species ("linking claims") and the Examiner restricts the claims to individual species, the 
linking claims must be examined because the restriction requirement is subject to non- 
allowance of the linking claims. Section 809.03 also instructs the Examiner to specify in 
the restriction requirement, which claims are considered to be linking by inserting Form 
Paragraph 8.12 when making restrictions that involve linking claims. Applicants note 
that in Paper No. 10, the Examiner did not mention that the claims being restricted to the 
human or murine form of the CYK-4 polypeptide (currently cancelled claims 7-10 and 
withdrawn claim 11) were linking claims and that the restriction requirement was subject 
to non-allowance of the linking claims. 

Thus, Applicants respectfully request that the Examiner confirm that he is 
examining all generic linking claims elected for prosecution that recite CYK-4 
generically and thus encompass both murine and human forms of the CYK-4 protein 
(currently new claims 45-63). 
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Conclusion 



All of the stated grounds of objection and rejection have been properly traversed, 
accommodated, or rendered moot. Applicants therefore respectfully request that the 
Examiner reconsider all presently outstanding objections and rejections and that they be 
withdrawn. Applicants believe that a full and complete reply has been made to the 
outstanding Office Action and, as such, the present application is in condition for 
allowance. If the Examiner believes, for any reason, that personal communication will 
expedite prosecution of this application, the Examiner is invited to telephone the 
undersigned at the number provided. 

Prompt and favorable consideration of this Amendment and Reply is respectfully 
requested. 



Respectfully submitted, 



Sterne, Kessler, Goldstein & Fox p.l.l.c. 




Ann E. Summerfield 
Attorney for Applicants 
Registration No. 47,982 




1 100 New York Avenue, N.W. 
Washington, D.C. 20005-3934 
(202) 371-2600 



332604v7 



jNCbi Sequence viewer 



Page 1 of 15 



Entrez PubMed 

Search i Nucleotide 



default 



CGC7CAGGAT 

rrtTCX- T AT 

Nucleotide 




ATCGSATCCCCG 
fATATACACACAt 
CTTCGCATACGTC 
£ IT AC TAAC € AAf ? 



Genome 



Limits 



Preview/Index 



Show: 



20 



SSendltGtl 



Hi: U61955 . Reports Caenorhabditis el...[gi:3258581] 



Structure 



History 



File 



ATT A TATA TCGATCGATC 1 
PMC Taxonomy 



Boo 



Go 



Gle§F 



Clipboard 



. De tails 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
. AUTHORS 
TITLE 
JOURNAL 

COMMENT 



CELM03D4 38750 bp DNA linear INV 26-JUN-1998 

Caenorhabditis elegans cosmid M03D4 . 

U61955 

U61955.1 GI:3258581 

Caenorhabditis elegans 
^§.n9.^h§.bdit_is e.legans 

Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; 
Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis . 

1 (bases 1 to 38750) 

Wilson, R., Ainscough, R. , Anderson, K., Baynes,C, Berks, M-., 
Bonfie-ld, J. , Burton, J., Connell,M., Copsey,T., Cooper, J., 
Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A., 
Fulton, L., Gardner, A., Green, P., Hawkins, T., Hillier,L., Jier,M., 
Johnston, L., Jones, M., Kershaw, J., Kirsten,J., Laister,N., 
Latreille, P. , Lightning, J . , Lloyd, C, McMurray,A., Mortimore, B . , 
O f Callaghan,M. , Parsons, J., Percy, C, Rifken,L., Roopra,A., 
Saunders, D. , Shownkeen, R . , Smaldon,N., Smith, A., Sonnhammer , E . , 
Staden,R., Sulston,J., Thierry-Mieg, J . , Thomas, K. , Vaudin,M., 
Vaughan,K., Waterston, R. , Watson, A., Weinstook, L . , 
Wilkinson-Sproat , J. and Wohldman,P. 

2.2 Mb of contiguous nucleotide sequence from chromosome III of C. 
elegans 

Nature 368 (6466), 32-38 (1994) 
94150718 

2 (bases 1 to 38750) 
Martin, J. and Murray, J.. 

The sequence of C. elegans cosmid M03D4 
Unpublished (1998) 

3 (bases 1 to 38750) 
Waterston, R. 

Direct Submission 
Submitted ( 23- JUN-1996 ) 

4 (bases 1 to 38750) 
Waterston, R. 

Direct Submission 

Submitted (26-JUN-1998 ) Department of Genetics, Washington 
University, 4444 Forest Park Avenue, St. Louis, Missouri 63108, USA 
On Jun 26, 1998 this sequence version replaced gi:l_397 33_9. 
Submitted by: 

Genome Sequencing Center 

Department of Genetics, Washington University, 
St. Louis, MO 63110, USA, and 
Sanger Centre, Hinxton Hall 
Cambridge CB10 IRQ, England 

e-mail: rw@nematode.wustl.edu and jes@sanger.ac.uk 



NOTICE: This sequence may not be the entire insert of this clone. 
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It may be shorter because we only sequence overlapping sections 
once, or longer because we provide a small overlap between 
neighboring submissions. 

This sequence was finished as follows unless otherwise noted: 
all regions were double stranded or sequenced with an alternate 
chemistry; an attempt was made to resolve all sequencing problems, 
such as compressions and repeats; all regions were covered by 
sequence from more than one subclone 



NEIGHBORING COSMID INFORMATION: 



. The 5' cosmid is K08B4, 2300 bp overlap;3 f cosmid is H10D12, 1200 
bp overlap. Actual start of this cosmid is at base position 1 of 
CELM03D4; actual end is at 37750 of CELM03D4 

NOTES: 

Coding sequences below are predicted from computer analysis, using 
the program Genef inder ( P . Green and L. Hillier, ms in preparation). 
FEATURES Location /Qualifiers 

source 1 . .38750 

/organism="Caenorhabditis elegans " 

/mol_type="genomic DNA" 

/strain="Bristol N2" 

/db_xref="taxon: 6239" 

/chromosome="IV" 

/clone="M03D4" 
gene complement (2816 . .4479) 

/gene="M03D4 . 3" 
CD S complement (join (2816. . 2917 , 4 4 02 . . 4 47 9) ) 

/gene="M03D4 .3" 

/codon_start=l 

/evidence=not_experimental 

/protein_id=" AAC24405. 1 " 

/db_xref= ,T GI : 1397340" 

/trans la tion="MRFFQVLLLVVLISATVALPFFNSPTSQTYGGQWRQMYTDENLE 
VPQNPRYIMSNRMRG" 
gene 10331. .13543 

/gene="M03D4 . la" 

CDS join (10331. .10773,10820. . 11068, 11117 .. 11416, 114 71 .. 11720, 

1177 3. .11955, 12001. .12723, 12780. .128 66,134 51. .1354 3) 
/gene="M03D4 . la" 

/note="Similar to kinesin-like protein; coded for by C. 
elegans cDNA ykl84h5.3; coded for by C. elegans cDNA 
ykl84h5.5; coded for by C. elegans cDNA ykl3d7.3; coded 
for by C. elegans cDNA ykl3d7.5; coded for by C. elegans 
cDNA yk31el.5; coded for by C. elegans cDNA yk35d!0.3; 
coded for by C. elegans cDNA yk35dl0.5; coded for by C. 
elegans cDNA yk36e3.3; coded for by C. elegans cDNA 
yk36e3.5" 
/codon_start=l 
/protein_id="AAC24 406 . 1 " 
/ db_x r e f = " G 1 : 1397341" 

/translat ion="MSSRKRGITPSRDQVRRKKLSIEETDSIEVVCRLCPYTGSTPSL 
IAIDEGSIQTVLPPAQFRRENAPQVEKVFRFGRVFSENDGQATVFERTSVDLILNLLK 
GQNSLLFTYGVTGSGKTYTMTGKPTETGTGLLPRTLDVIFNSINNRVEKCIFYPSALN 
TFEIRATLDAHLKRHQMAADRLSTSREITDRYCEAIKLSGYNDDMVCSVFVTYVEIYN 
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NYCYDLLEDARNGVLTKREIRHDRQQQMYVDGAKDVEVSSSEEALEVFCLGEERRRVS 
STLLNKDSSRSHSVFTIKLVMAPRAYETKSVYPTMDSSQIIVSQLCLVDLAGSERAKR 
TQNVGERLAEANSINQSLMTLRQCIEVLRRNQKSSSQNLEQVPYRQSKLTHLFKNYLE 
GNGKIRMVICVNPKPDDYDENMSALAFAEESQTIEVKKQVERMPSERIPHSFFTQWNS 
ELDGSVRMEDDGSREIPCPPTFCLTDCNDKDTVDSMYKYARKLSSLQNSSEEGPSSTL 
LTMIRQYMMEADYQRVEIARLKDSLNDKDEEIKKLRGFCSRYKRENASMKERIASCEQ 
GEQENALVMEKLMEQKMEDRKI IQSQKKAMRNVRGIIDNPSPSVASLRSRFDQENVAH 
PTAPIQTPPPPYQTPGRAPVFKKRLEATTSTTVMSGSSSGGSGQQGYVNPKYQRRSKS 
ASRLLDHQPLHRVPTGTVLQSRTPANAIRTTKPEMHQLNKSGEYRLTHQEVDDEGNIS 
TNIVKGNVIPTVSGGTAVFFNDIERLTHESPSTRK" 
gene 10331.. 12950 

/gene="M03D4 . lb" 

CDS join (10331. .1077 3,10820. . 11068 , 11 1 17 . . 114 1 6, 1 1471 . .11720, 

11773. . 11955, 12001. . 12723, 12780. . 12 950) 
/gene="M03D4 . lb" 

/note="Similar to kinesin-like protein; coded for by C. 
elegans cDNA ykl84h5.3; coded for by C. elegans cDNA 
ykl84h5.5; coded for by C. elegans cDNA yk2g4.3; coded for 
by C. elegans cDNA yk2g4.5; coded for by C. elegans cDNA 
yk31el.5; coded for by C. elegans cDNA yk35dl0.3; coded 
for by C. elegans cDNA yk35dl0.5; coded for by C. elegans 
cDNA yk36e3.3; coded for by C. elegans cDNA yk91c6.3; 
coded for by C. elegans cDNA yk91c6.5" 
/codon_start=l 
/protein_id="AAC24_4p7 . 1" 
/ db_xr e f = " G 1 : 139 7 3 4 2" _ 

/ translation="MSSRKRGITPSRDQVRRKKLSIEETDSIEVVCRLCPYTGSTPSL 
IAIDEGSIQTVLPPAQFRRENAPQVEKVFRFGRVFSENDGQATVFERTSVDLILNLLK 
GQNSLLFT YGVTGSGKTYTMTGKPTETGTGLLPRTLDVI FNS INNRVEKCI FYPSALN 
TFEIRATLDAHLKRHQMAADRLSTSREITDRYCEAIKLSGYN.DDMVCSVFVTYVEI YN 
NYCYDLLEDARNGVLTKREIRHDRQQQMYVDGAKDVEVSSSEEALEVFCLGEERRRVS 
STLLNKDSSRSHSVFTIKLVMAPRAYETKSVYPTMDSSQI IVSQLCLVDLAGSERAKR 
TQNVGERLAEANSINQSLMTLRQCIEVLRRNQKSSSQNLEQVPYRQSKLTHLFKNYLE 
GNGKIRMVICVNPKPDDYDENMSALAFAEESQTIEVKKQVERMPSERIPHSFFTQWNS 
ELDGSVRMEDDGSREIPCPPTFCLTDCNDKDTVDSMYKYARKLSSLQNSSEEGPSSTL 
LTMIRQYMMEADYQRVEIARLKDSLNDKDEEIKKLRGFCSRYKRENASMKERIASCEQ 
GEQENALVMEKLMEQKMEDRKI IQSQKKAMRNVRG I I DNPS PS VASLRSRFDQENVAH 
PTAPIQTPPPPYQTPGRAPVFKKRLEATTSTTVMSGSSSGGSGQQGYVNPKYQRRSKS 
ASRLLDHQPLHRVPTGTVLQSRTPANAIRTTKPEMHQLNKSGEYRLTHQEVDDEGNIS 
TNIVKVNSLVSTQKHACTVPLSFSRVLITHLS " 
gene complement (14 94 5 . .19402) 

/gene="M03D4 .4" 

CDS complement (join (14945. .15429,15476. .15753,158 66. .15987, 

1607 6. . 16162, 16932. . 1704 5 , 17096 . . 17305 , 1 9262 . . 1 £402 ) ) 
/gene="M03D4 .4" 

/note="contains multiple region of strong similarity to 

C2H2-type zinc fingers ( PS : PS00028 ) ; coded for by C. 

elegans cDNA CEESR82F" 

/codon_start=l 

/protein_id="AAC24408 .1" 

/db_xref ="GI : 13^"345"'" 

/trans la tion="MSSLYLCRDCSGAFHSLDELQRHEREEHETVEQGDQEEDRMEDD 
SDELAMIKIKIEDSDFLSDTDSSQLSMNPTTPSEKSSSGEKGRYECEDCHEMFAVKRE 
LATHMRIHSGEQPHSCTQCGKEFGTRQLLKKHWMWHTGERSHVCPHCNKAFFQKGHLT 
QHLMIHSGGRPHECPQCHKTFIFKFDLNRHMKIHQERGFSCQQCGRSFLKQVMLDEHH 
LKCKGKPSSPIRSLLTPTMKAGLESAISIKPPQESMILSSETIAKMAQKLLIQQQENH 
RNALNTLLVKQHENILNNNNNNESNILNGSVMHKDAGFEIPAPTIPLSLTCMICKSQF 
. . . NSQPSFTLHMYMHHIANQNPNLSIDSTHIHHTHQPTTISHQNDPTPLGSDSDLATDTS 
CASSPQKTSPLQLLESSCLEQSSVSPSSSSGASPQPTASESSTSSCKDCTNSWQRVHD 
LEQQMVKKDEEFENYKQMTKQVGFCKVF" 
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gene 
CDS 



ORIGIN . 

1 
61 
121 
181 
241 
301 
361 
421 
' 481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 



complement (35290. .38732) 
/gene="M03D4 . 6" 

complement (join (35290. .35386, 354 88. .35777,36123. .36377, 
364 27. .36516, 36606. .36711, 36957. .37201,37785. .38281, 
38609. .38732) ) 
/gene="M03D4 . 6" 

/note="coded for by C. elegans cDNA yk224fl.5; coded for 

by C. elegans cDNA yk224fl.3" 

/codon_start=l 

/protein__id="AAC2 4 409 . 1 " 

/db_xref ="GI : 3258 582 " 

/translation="MCSEKKTAHMLLSNCPVQGEGQQNTTGQFFVTTDSQFQFIKYTF 
KEIEKKLKNAIQNCLPNSKQIPSGLNFNYQMKFSVKSSLFHPDFQPLSELIMSYNLYY 
HGIKCNSRDGKKLSQAFKGLNISVNYGKNKNLQEDAVMFKIKGFSHSSREQRFVNEGV 
EITVESYFRRKFGIHLRYPELMTVVAEGRTSLLYFPPELMLCSPSQKMASASPFIRHD 
VTERLVEEVGLKSNYFNDFITVGESVEVDGIVLPTPRI FFRDGQETSLNNQSFRNPTD 
FAQTGFFVDAKQQLGGLNYVVNSETWNDSGLLLIGLSTAPYLNSYSSENVTTIGFVSN 
TMDHPQKFAGGYKYVKSGSDVFGQVMPEILLNSLRSARKARKIKPMNIVIYLCGMSES 
RFSIVKEEYVRNCHSVFKTLGEKYSPQLTIIVGSKGHSTRLYARGERDQISNLQPGTI 
VDSVIVSPDYNKFFHCGAVARQGTCKATKYTVLYPESPKMEWIQRMTNDLCYMHEIVF 
HPVSLPAPLYLTAEMAERGTKNLAEKNGPI I FQGIVDFDATNAKYGYRNKGLADTRFN 
A" 



gatcagacac 
gagatcagag 
ttttgaaaat 
attcggacaa 
aaattggagg 
aaagtatgaa 
tattcaatcg 
cattccatgc 
ttccaaaaat 
tgcaagagac 
agcttgttga 
attcagattt 
ttgatttttt 
tcgtgtactc 
atttccagac 
ggaaaaataa 
tttgcgcgcg 
atgttgagca 
ctgaatttca 
tctctcaact 
tcagtcatga 
tcaacgatgg 
aagcgatggg 
ttgatggaca 
tgaaagtttg 
attgttcgat 
ataggacaac 
tctgaattca 
tttttcagtt 
aatcttattt 
atgttgaaat 
cgcaaccctg 
acatagggtt 
tcaatagtaa 
gaattcagaa 
aatttttcag 
cacattttcc 



ttccgagcga 
gcaggatcca 
taatcgatta 
gagaaaatat 
attcgtatca 
aaatacggac 
tctccggtca 
ttcctccacg 
atataattaa 
tgataatttt 
tagtgttacc 
gccagcattt 
tttcgttcat 
tcggacagga 
gtttgaacat 
ttaatcaatt 
aaatttaaat 
atgagaaaat 
ggtcgataag 
tcacaaatgt 
caagattatt 
agctgcgtgg 
acaggttgcg 
tggtgaagcg 
gtttggagca 
accacctgtg 
cggcgatgtt 
gaattatcac 
ttggctgatt 
tggagggaat 
taacgatcga 
cggcacaatt 
gcgaaattac 
agtcgaattt 
cttttttgtt 
gttccaattt 
tacaaatcat 



caacaacttg 
aatatttatg 
tttcaggatt 
ttcgatttaa 
caaagaatca 
tgcaagtatc 
cagacagtca 
aaatggggag 
tttcattttt 
gcagttcgtg 
ggaatcgcac 
tctctccaaa 
ctaatttgaa 
actatttttc 
ttgttctaaa 
gtccgagaga 
cagataagcg 
ttaaagacaa 
cagcaagtaa 
gcattccaaa 
caacatcaag 
acaatcatct 
aacccgattt 
tcgcgtgtcg 
accccggttg 
agtcaggttc 
gaggtaatat 
aaaagaaaaa 
ttcctgtttg 
tcaaacatcc 
tccgtgaatt 
ttttcctctt 
gccccgcctc 
ttcattctta 
ggtatttcga 
cattggtacg 
tggaacgaca 



atttttcaac 
tgagagtttt 
attgtgccgc 
atgctcaatt 
aagtaatctc 
tttgtattgc 
gtacaagata 
cctttactat 
tctaaatttc 
atggatttgt 
tgccaagatt 
aatgcctcga 
ttccgcgcgc 
gtgtcttatt 
acaaaaatca 
agtacacggc 
gcaaagaaat 
atgaattaac. 
ttcttgatgc 
tgattgacaa 
ccacagctat 
cgactgataa 
ctccatgtcc 
agttgcatgg 
aaacgacgtt 
gaaatgagca 
attggatttg 
cgaaaaaaat 
gattcgctcc 
agtcacaatg 
ttgcactgaa 
tttctgtccg 
tctctctttg 
ttcgattttt 
agtcaaattc 
agatgatggt 
tggaccgtgc 



gggaaaagtt 
gaaaatttta 
gaagactctg 
cttctatgga 
aaaaccatcg 
cagtggaacc 
ccttcatgtt 
tcatctttgt 
agttgacgac 
ttactatgga 
gagaatcaga 
aatttttttt 
aaacgagatc 
ggaaattgaa 
tgcaaaaatc 
cgagaatcac 
taaattttgg 
ttgattataa 
atcgtgtagt 
tgagctcgtc 
caatgagcat 
ggctgaatat 
agttgtcggt 
aagagatttt 
cagaagtgaa 
aactcactgg 
gaacacaatg 
cgagctattt 
agaatgagaa 
aaaaatttca 
aaagaggcga 
aaaaaggaga 
cgaccctgat 
ctttgttttg 
ccaattgttt 
gtcgtctact 
agaattgtgt 



cgacatcctg 
atagtttttt 
tatattagtg 
tgtggaatgg 
aaaaagaagc 
aaagttgctc 
gagggaaatg 
aagtacttaa 
gaacgtggtt 
agtgtcgtta 
aaggtaggga 
tttttttttt 
gattggtttc 
attttgaagc 
cgaaaaatcc 
atgaaactca 
aatttttgtg 
actaattaaa 
gaagaaccag 
tatctttgtc 
cgtcatcaga 
cgattcttcg 
tcacttgaag 
aagccaaatc 
gagagtcttc 
atgtttacga 
agaatagagt 
taaaatgtga 
ctgctaggcc 
tttattaatt 
agcatttttc 
aaaaaccgtg 
tgggttaatt 
tgaactatct 
tctaatctcc 
cgtcgggtct 
ccaattacta 
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2221 gattccactc tcgcgggatt actgtatctt 
2281 ttaaatttcc accccgggtc ctgaagcctt 
2341 gtaataatca cttttaactt acttcatttt 
2401 ttttccttct tcgggtggct tttccatcat 
2461 ttgccgtttc tctgtgtctc ccccttctac 
2521 atttttgaga aaatcctgtt gaaaatgttt 
2581 tctaattata atcaacgcgc cctcatatta 
2641 acattcacac ggtggtatct cccttttttt 
2701 ttttgtctcc aacattttcc ttactgaaca 
2761 aacacatgtt tattcagtta ttccgatgtt 
2821 cacgcatccg attcgacatg atgtatctcg 
2881 tgtacatttg acgccattgt ccgccatatg 
2941 ctaaacactt tgtaatttta aatttcatcg 
3001 ggtatttgaa aacttgggga aacaattaat 
3061 tagaaacgat tttggattca aacggtgaga 
3121 taagtaaagt ttatataatt tatgctaaaa 
3181 ttcaaaaaac tttgagttca tcaaaacata 
3241 gaagcagtaa tcaaacaaaa tcatcgagcc 
3301 aaacagtaaa aatattagcc aaatgtaatt 
3361 aaaaaatcga gaatcataga aatcgatttc 
3421 aaaggaaaat atgaaaaaaa tctgaaaaat 
34 81 gagctgtgcg gggtcggtgc atcacgccgg 
3541 ttttttctgt cgacatgtca cttttgaaaa 
3601 tactttctta aataattgta aatgagattg 
3661 ccaccaaacg gtttaattat aaatggttaa 
3721 aattctcaag ttttagaaaa attctgaagt 
3781 ttttagttta aaaaattatt aaaaaaacag 
3841 agccgaattt tgattgaaat catgaggatt 
3901 ttcgtcagct aaataacttt aaaaaatatt 
3961 taacaataac accatatcca aagctataag 
4021 cccatttgaa acagtaaatt attttgtcaa 
4081 gtcgagaatg gatttaaaga cttctgaaac 
4141 ataacgaaaa atgttttttt taatacaagg 
4201 gcggggtcaa aaattaatgg aatatttgga 
4261 tactcatata aactaaaaaa tggtgtttta 
4321 ctactgtgca atattaagcc ccacatcaat 
4381 taaattttat ttcaaactta cagttggcga 
4441 gatgagaaca acaagcaaga ggacttggaa 
4501 aaattaaaaa gagctgtgac gaagttcgca 
4561 attacgcaga tattgggtta aaagggataa 
4621 agaaaaaaat tatggatttt tgaagttctt 
4681 ttagaccgga aacaggaaac aatttgagtt 
4741 ctttttgtaa tatatttgcg actggatttt 
4801 aatttgctca attaatatta cacacttggc 
4861 aaatacccca aaaaaattga ctttacagaa 
4921 aaaaacatga gattttcagg aaatccagag 
4981 tgggcgctat aattctttca aaaactagga 
5041 caatattttg ctattcgatt attttgctat 
5101 aaaaaagaga cctttcatag aactcaactt 
5161 atagaattat accaaagaag ttgttttttt 
5221 tattcttcaa cactgattgt gtaagcaata 
5281 tctgaaaaag tgaataaaga aaattacaag 
5341 aataatggtt tctgaaaata actgaattta 
5401 tataaaaatt acattactgg agaaacaaaa 
5461 tgttttcatt ttcagaaatt tcaagaaaaa 
5521 tatttaacag ttgaaaatgg agtagctcca 
5581 gggctcacag ggctttaaac aaaaaaggct 
5641 aaaaagccca aactttcgga tgtttaaaaa 
5701 tggttttcaa ttagattcac tatttacaca 
5761 acacacgaat caattgtgtc aaatcattct 



tatattgtct cctaatttct cccaattcgt 
catttctaca tttttttctc gcttccatat 
cttgttgcaa tactttttgt ttattttctt 
tttgttcttg gatttattat ttgcaatact 
attcatctct cagcactctc taattacagt 
cggattgaat tatactggat ttttctctgt 
attaagtata tagatacata gatataatta 
tcggataatc tcttcaagtt tttccttttc 
taaaaatttt gaaattaagc cttttttatc 
ctctcttttt cagataaatt atcaactatc 
gattttgtgg aacttccaga ttctcgtcgg 
tttggctctg aagaaaaatt tctatatttt 
attgtttaga agattgagaa tggaagaatt 
atatttttga aacaggaata aattttccag 
attttaaaag aactagttaa gaaattccta 
ttgaagtaat tgtttacaga aatttttttc 
gagcctcaac actaattatc tcgctaattt 
tccaatcggc tttaatacat ctcccaattg 
tgtaattcct catttccttc taaagtttct 
aaagatttct gaaatagaaa tttaattttt 
acattaattt tccattgggc ttccatgtga 
cctcacgaaa tttggaaaat ttggaattac 
aattcttctg ggactcttat tcattcatcc 
aaataccaaa aaccggtttc aaaacccttc 
taagaaacag taaattattt tggaaaatga 
aatccttaat ttctgaaacg aaaatctaat 
cagaaaacat gttatccaag atgtggactg 
tctgataagt ttggcatatt ttctagagct 
aatgggtgtt attcaagtaa tgccgcaaaa 
cccccacacc gatcggactt ttactaatgt 
atttaatttg tagttcgtta gtttcaaaaa 
agaaattatc tttctagtga atagtgttaa 
aattttgaag taaaacaccc aaaatattat 
ttttactttt tctgtacaca tcaccgcgaa 
cagagttatt ctctaattaa cagttaagtt 
cagctttact atatctcaca cttgaaacag 
gttgaaaaat ggaagagcca cggtggcaga 
gaaacgcata gctctctgaa aatctaatga 
ggcgcagtgg gagtataaaa ataaaatgca 
aaaagagacg gataacaata ataataatga 
ttagaaaaca tgagtaattg agttttgata 
tagagtttaa aattcacatg ttcttctgaa 
gaggcttcaa gttgttcata aaagtataat 
agcgcaaaac tctgttaaaa caatatttca 
ttccagtttt ggcaaattgc cgagatttcg 
caatttttgc aaatttttaa cagtttccca 
aaattaaaat ttttattttt caaaaaatca 
tcgttttgca caattttcgt tcaaatattc 
cgaaaagttt tgtaaatttt atattttttc 
tttggttttc caataattgt actttcttat 
gtctcaatac aatgagaaat ttatgaatag 
aattctcacg attactctaa agaatgtgca 
tttatacttt catttttata ccctaacaac 
aatttgccgg aattagtttg aaatacattt 
taattcttac atgtagattt ggagtttttg 
gttgagaaat tgttgataaa ccattttttt 
ttgaacatgc ttgaatgctc taaatttgtt 
attaagattt taaaaaatta tgactttcga 
ctaattgtga gctctaatag agaagatgaa 
aggagtttta gcaataatct gaaatagtat 
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5821 aaattttaaa aaattaaaaa ttctcactca 
5881 tgaataaagt caaactacaa acatacttgt 
5941 attggaatat tctatatgct catttttgca 
6001 attattttca ttacgaaaat gtccttttac 
6061 tttgacagca atatgggaat ctgagattga 
6121 aatttgcaga atttactaga aaaaaagtta 
6181 tactaataat tctgtagtga aacagtaaat 
6241 aacgcgttat ttagctcggg tcacaaaaaa 
6301 gaataactct gaatcgcccc aagtttatcc 
6361 tttatatttc tcggaaacaa attgaattcc 
6421 aaaacctata aatatgtcaa accctaaaaa 
6481 aattattttc aagccaactt ggtctgaaca 
6541 aaataaaaat gatttactaa acaagaattt 
6601 tttttttacg gtttcatgag ttctattagt 
6661 gcagctttta gaacaattat ttttaaatga 
6721 cgatgccaaa agaaaaatga aaattttcct 
6781 tgttgaaact gtgcccacaa ctgcacgaaa 
6841 aaaatattta cagtttcaag tcttctgtta 
6901 cagcacttat gtataaaagt ttattttcaa 
6961 aatcagactt tttcatgaga gtgtgacatt 
7021 aatgcgtctg gtatttcaca gtttttctac 
7081 atagtgaact ttttgcagaa aacccaacaa 
7141 aaaattcgtt atttgatatt taaaagtaaa 
7201 tctactacaa aaaaagaaca aaattttgca 
7261 ataaaaaaaa cctactaata aggatgacac 
7321 tcataaaaac tgactcacat atctaagagt 
7381 agcgtggtgg cttttatagt tagagaaatg 
7441 gaaagtgacg ttagtgtttc aaaagttcga 
7501 aaggcctgaa cttttcagac gaaatttctt 
7561 tagtacgact tgaaaataat caaatcatta 
7621 atttagaaac tttttagtta aattttgtaa 
7681 ttacagctca aaaaaattgc gttgaaaatt 
7741 tacagtttca agtcttccat tagtatgatc 
7801 gagaccattt aatgaacaca atttgaaaat 
7861 tggggaatgc ggcaaagcca aattagccta 
7921 ctttcaaata attgtttcgc gccgaaaata 
7981 aaatgagaaa aagggggccc cttttaagaa 
8041 gtttcatcaa aaaagtattt aaaaaattga 
8101 ttacagtact ctttaaaggc gaacatcttt 
8161 caccgggtac cgtataaaaa gcaaaaagtc 
8221 aatgagttct gctccttttt atactgtgtg 
8281 cctcctcaaa cttaaaaata aaagaaaaac 
8341 tcctctccat actctcgcgt tttgttcctt 
8401 ttgcaaaaaa agattgaatg gacgagaaaa 
8461 ggcagaaacc ttttgaactc ttctcttctc 
8521 taattaacca cctcgtgagt ttccaagtgt 
8581 agaatattct gtagttttcc cacgaagggt 
8641 ttctgagaat gcgtattgcg catcacattt 
8701 aatatcgctg tcacaattcg aaaataaatt 
8761 tttaaaaaaa ttactgtagt tttcgctaca 
8821 atacgcattc tcagaattta gtgtgcccgt 
8881 cataatttgg ggagcatctt caacaatttg 
8941 gctcctcgca aaaaaaaaac gaatcaagaa 
9001 gatcagagat ataagaagct tataaataat 
9061 ttgaatctta atgtgtcgcc cttctctttt 
9121 cggaatcttg agtctgaaat ggaaaacact 
9181 acttcgaaat tgctgtggaa agaattattt 
9241 cagaactgct ccgaattcat tgctcgcttg 
9301 tatttgaatt ttttggcaaa tactgtttta 
9361 ttactacgtt ttatctttta ctaatgattt 



aataatctag agcgtattca aagcttttac 
ttacattttc caagccattt tctaaaagta 
tgatcatcta cattcaattt catctggtat 
cattcattca tttaatgaga aaaaggccaa 
cattttaaat tagaaacctg caaatgacag 
acgaattcaa taaaagtttc aagcataact 
ttttgaatca acattttaat aaatttatta 
ttctgtggct aaatatttct agaaaaataa 
ccaagagcaa gtatccttta attctaagta 
aaaagtctac aacatgttgt tttgacaata 
tacgtgtcat ttattctgtt aataatcaaa 
atgttttcca tatttttagt gtatttcatc 
tgtgatatat ttggaatttt ggcgtttcat 
atgacttgaa aataatcaaa ttattcactc 
agaagaaaaa acaaaaaaag caaacgtttc 
taaatttccc atttattttg actttttgac 
attgcgttaa taattgaaaa attcaaataa 
gtaggatttg acaattcttt atctattact 
atgcatgaat gaagaaagaa aacaaaaaca 
tcagttggac atctgaaaat cgtagcaaac 
agaaaatatc agaggtcaaa taaaatactg 
gggatatagg cttggcagat gcaaaaatgc 
ttatttaatg tatcatctca atattaaaga 
aaatatttcc cttcacttct acgtcgtcac 
atgatgcttc aaatgatatg caccaaatag 
ctaaaaagtc tagacaaata tcttaagcca 
gggcggagtc gaggggaaga ctaaagctgt 
tggcctggta gtaaatgatt ttggactgat 
taaaaatatt tacggtgtca tgactttcat 
caacagaatt atgtgaggag ttatgtattt 
cttgtattga agattatttt aggattatgc 
tgaaatatta tagaaatata ccaaaatatt 
tgaaaattgt cctttattgt atttttatta 
accgatatca . ttttacagta tatttgagtc 
ccccagaata ttttccaatt caaaaaaaaa 
cgatacactg gtctcggcat gttttttgct 
cactgtaatg ttaaacaaaa aataaattgg 
tgataatttc gaagcaacaa aagttaaaaa 
ttacatttaa caaaaaagtt gtcgcttaaa 
gcatctggga attcaaaaag tacctagcta 
ca.aaaagtcg tgtaggcgta acctattcca 
aatggaagga accgtatttc cgcatgaatt 
atttaacatt tttttgtttt tgaattacct 
aataaatatt cgaagagaaa agttgtgggc 
atcaacacac aaaagaattc aaagaatcag 
ttggagtgtg gtgatttctg gagaaaattg 
gaaaacaaaa aatattacgg gaacactaaa 
gacgctcaac gaaaaattgg aggaagttta 
tgtttcgaac tttccgcccg taaatcgaca 
agacattttg cgcgtaagat atgttgctca 
aatagaatgt ttaatcctgt ttaaacgttt 
gcttccccga ttcggctgcg cccactcatt 
ttaaaatcct gggaacccca tttgattgat 
ttaatgcgag tgcaagaatt cagacgattt 
cagatacgtc atgacgtgat ttatctgaaa 
gagattttta aggacggttt taaaaaataa 
ttcaacaaat tggcacattt tttccaaatt 
gattgaacac caaggatgcc ctttttctgt 
ctacttgatc ttgtactaca attggctctt 
tttactacga cttgaaaatt aatttttata 
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9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 
10621 
10681 
10741 
10801 
10861 
10921 
10981 
11041 
11101 
11161 
11221 
11281 
11341 
11401 
11461 
11521 
11581 
11641 
11701 
11761 
11821 
11881 
11941 
12001 
12061 
12121 
12181 
12241 
12301 
12361 
12421 
12481 
12541 
12601 
12661 
12721 
12781 
12841 
12901 
12961 



gtattataat 
ttgtactgtt 
cactatgaaa 
agtacacaaa 
cagaaccaaa 
cacaaaaaga 
aacttttaca 
tcttcctatg 
acaatcatat 
ctttcggaac 
ggcggcggac 
gctgccttgt 
ctggtctcac 
gtttgctttg 
ttggaataaa 
atttcagaaa 
gaagaagtta 
tactggctcg 
accagcacag 
agttttttcg 
tttaaaccta 
aaaaacgtat 
attggatgta 
taatattatt 
tcgagatccg 
tttccacatc 
acgacgatat 
acgatttgtt 
cctccgctag 
tgtacgtcga 
tctgccttgg 
gttctcattc 
gcgtgtatcc 
tggctggatc 
aaaattttag 
tcaatcagtc 
gttcctcaca 
aaaactatct 
atgattacga 
cgtaactttc 
caagttgaac 
gagctagatg 
acattctgtt 
gattgcaatg 
cttcaaaatt 
atgatggaag 
aaggatgaag 
tcgatgaagg 
gaaaagctta 
atgagaaatg 
cgatttgatc 
tatcaaactc 
actgtaatgt 
tatcaaagaa 
ccaacaggaa 
gaggtaggaa 
tgcatcagtt 
gaaacattag 
gtaccgtccc 
tccgtctttt 



tctagaagta 
gggctactgg 
tcagcagaac 
gttgacattg 
accaagttaa 
atataaagtt 
gatgataatt 
tattatgtgt 
ccgacctgtg 
ccattacagt 
aaaacgcgag 
gtgagagacg 
tgcgcaccct 
acttgttttt 
accgtttttg 
atgtcgtcgc 
tcgattgaag 
actccaagtc 
ttcagacgcg 
gaaaatgatg 
ttgaaaggtc 
acaatgacgg 
attttcaata 
ttcttccaga 
tgccacattg 
acgcgaaatc 
ggtttgctcg 
ggaagacgcc 
ttccagagta 
cggagccaaa 
agaagaacgt 
cgtattcact 
aacaatggac 
tgagaggttt 
agcaaagcgc 
cctcatgact 
aaaccttgag 
ggaaggaaat 
tgaaaacatg 
agagtgctct 
gaatgccatc 
gatctgttcg 
tgacagtaag 
acaaagatac 
catcagaaga 
cagactacca 
aaatcaagaa 
aacgaattgc 
tggaacaaaa 
ttcgtggaat 
aagagaatgt 
caggacgtgc 
ctggatcttc 
gatccaagtc 
cggttcttca 
aaaacattaa 
gaacaaatcc 
cacgaatata 
gttgtcgttt 
cttctcacaa 



aacaaatgta 
cactgtggta 
ttattgtgta 
acactgacca 
tctgggggaa 
ctattaaaaa 
aatttccttg 
ttatttttat 
ctttgatgcg 
ctcaactccc 
ctagcgagtg 
cagacatagg 
ccccctccgc 
gttgattttt 
taaatcaata 
gtaaacgagg 
aaaccgacag 
ttattgcaat 
aaaacgctcc 
gacaagctac 
agaattcgtt 
gaaaacccac 
gtattaataa 
gttgagaaat 
gatgctcact 
actgatcgtt 
gttttcgtaa 
agaaatgggt 
ttgacgaagc 
gatgttgaag 
cgtcgtgtat 
atcaaattgg 
tcatcgcaaa 
gtttttttgt 
acacagaatg 
cttcgtcagt 
caagttccat 
ggaaaaatca 
gttcgttcag 
agctttcgct 
cgagcggatt 
tatggaggat 
aatttcacaa 
ggtagattcc 
gggaccatct 
gcgagtagag 
gcttcgaggt 
ctcgtgtgag 
aatggaggac 
tattgataat 
ggctcatcca 
tccagtcttc 
aagtggagga 
tgcatctcgt 
gtctcgtaca 
taaattccgc 
ggagaatacc 
gtgaaggtaa 
tcccgtgtcc 
agcacacacg 



gaatattcta 
tatcatgttt 
catttggctc 
atttttaaaa 
atacaaaacc 
ttagcaaaac 
tttgtaactt 
gtctttaaat 
gccacatcag 
cgcactgaag 
ccaccgcaat 
tattaaaggc 
cgccagtttg 
cttcgaaatt 
taacaaaact 
aataactcca 
tatcgaagtc 
tgatgaggga 
acaagttgag 
tgtttttgag 
gttattcact 
tgaaaccggc 
tcggtaagct 
gcatcttcta 
tgaaacgcca 
actgtgaagc 
cctacgtcga 
aagaaaaatc 
gtgaaattcg 
tctcgtcaag 
cgtccactct 
ttatggctcc 
ttatcgtttc 
tgtaaactga 
ttggtgaacg 
gtattgaagt 
atcgccagtc 
gaatggttat 
aattcaaaga 
gaagaatcac 
cctcattcat 
gatggaagta 
attgaatggt 
atgtataagt 
tcaactcttc 
attgcacgtc 
ttctgctcaa 
caaggagagc 
aggaagatta 
ccatccccgt 
acagctccaa 
aaaaaacggt 
agtggtcaac 
ctattggatc 
ccggctaacg 
aattttattg 
gtctcacgca 
attccttggt 
ttatcaccca 
taacgcattc 



gtgaacttca 

tttttttcta 

ttaaaaattg 

atagacaaaa 

aaactacgaa 

gaatcttata 

attcgtatgt 

gatttttatt 

gaactttgag 

gaaggcccat 

attacaagag 

gcatagattt 

aattctaacc 

tgttattttt 

tgttttattt 

tcgcgagacc 

gtttgtcgtc 

tctattcaaa 

aaagtgttta 

cggacatctg 

tatggagtta. 

acaggactac 

tttgaaacga 

tccatcagca 

tcaaatggct 

tataaagctg 

aatctacaac 

gagcaataat 

tcatgatcgt 

cgaggaagct 

tctcaacaaa 

gagagcctac 

gcagttatgt 

aatatatatg 

tcttgcggaa 

actccgtcgt 

aaaattgact 

ttgtgtgaat 

atttactaat 

aaacaattga 

tcttcaccca 

gagaaattcc 

tttgaacaaa 

atgctcggaa 

ttactatgat 

tcaaagattc 

gatataagcg 

aagagaatgc 

ttcagtcaca 

ctgttgcttc 

tccaaactcc 

tagaagccac 

agggttacgt 

atcagccact 

ccatacggac 

taattcaatt 

tcaagaagtt 

ttccacccaa 

tctttcctga 

taacaggcac 



catgttttct 
gaaatttatt 
atagagatag 
actacaaaac 
gacacttttt 
ttggcacctc 
tttttattaa 
atatctcact 
gattttcatc 
agcacacggt 
taccactctc 
atctgcgtgg 
gttctcacat 
tcacggtttt 
aattcaattc 
aagtccgccg 
tttgtccata 
ctgttcttcc 
gatttggacg 
ttgatttaat 
ctggatctgg 
tgccgcgtac 
aaaaagcaaa 
ctgaatacat 
gcagaccgtc 
tcaggctaca 
aattattgct 
tgatagtgac 
cagcaacaga 
ctcgaagtgt 
gattcatccc 
gagacgaaaa 
ttggtagatt 
tttaatgaga 
gccaactcga 
aaccaaaaga 
catttattca 
ccaaagcctg 
tatttaaatt 
agtgaaaaag 
atggaattct 
gtgcccaccg 
ttgtttccag 
actttcatct 
tcgccaatac 
tctaaacgac 
tgagaacgct 
tctggttatg 
gaagaaggcg 
tcttcgatca 
accaccgcca 
tacatcgacc 
taatccaaaa 
tcatcgagtt 
tactaaacct 
gttttccaga 
gacgatgaag 
aaacacgcct 
tacaaaactg 
catctaaacc 
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13021 accagaactt caactggaag actcgcaaag cagttcgtat tcaccgtgta cggatgtaaa 

13081 atcgataacg atatactcta atcatgatgt tgaagaagaa gatgaatgag aattatttga 

13141 aatgaatgtt tgctggggga atgagaagat gaagctgctc tttttacatt tttatgtgtt 

13201 tgttgtatgt ttttgattgt ttattggcta ttttagatac tgttttagta gtgcttgaga 

13261 aaactgcctg aatttgcttg atatatactt tttgagttaa aatttcatgc ttaaaattaa 

13321 ctaatttagt gctagttcta tatgcattta atttcagttt ttcatgtaat gttttaggtc 

13381 atataacttt aaaaatacat ttatttttgt gttccacttt ctcatctgtt gctcatctcg 

13441 aatttttcag ggaaatgtaa ttccaacagt ttccggtggc actgccgtgt tcttcaacga 

13501 tatcgagaga ctcactcacg aatctccgag cacacgaaag taattataag tgccgtcttt 

13561 gatcttttta cctccatcct gctgcatttt catgtcaaat ttgtaccata ctcttcaaat 

13621 tttctccaaa tatttattaa tttattatgt ttctcattca taatcccatt ttccgaactc 

13681 gaagagaact cttgtttttt tgtagacctc accgcatttt tatgatcttt cacttcttcg 

13741 tctttttctc cttctacgca caattttttg atatcgatct gaagaactct gttcattttt 

13801 attcccgacg atgatttgtg tactagtaaa tgcttttgta ttttaatttt aattaaactc 

13861 aaagaatata aaatatgaaa ataagttagc ttttattttc gttcaaaaaa atgaatctca 

13921 ataaataaga cccgatttag gtcgagggtt acatagaaaa tggtgggaaa aaggcacggt 

13981 tttatacaaa taattatgat ttaatacgcg ttttttgatt ttttggttgg tatttcaaaa 

14041 aataattcaa tcgaaaaata aaaagggatt atcatgaaaa agaggtcaga aatattgatt 

14101 atcagagaga tttaattttt taaaattaaa gttaaattgg gaaaagccag gcgagagaca 

14161 agatcagaaa ccacgtggtg agagaaatat gagaaaaaaa aaagaaaatc ggtgaaaatt 

14221 aattgcctag agagtatttt taagatgagt gaatacattt actgcattca tcatgaacat 

14281 attatccggt gttgctgtat ttgtcaagaa attggaaaca tttgcgatga tctgaaatat 

14341 acattcgttg aaacagtgga aattttgaaa tattttccgc aagtctggca tgaaatttgc 

14401 aagcgtgccg caccgttttt ttcaaaaatt gctattaact ttcttctctc cagtttaatt 

144 61 tctttgcaaa agtttgcaag acaataattt gtcggactat tttaaagatt ttccagaagt 

14521 ttggtcggtt aaaaaaattc tgggaacgtt ttaaatatat tattgagacg cagaaaacat 

14581 ccatttttga aatcgacgtg taaacgcgct caaccgctaa tccccccaaa cgatgggtct 

14641 cgttggctat tttcccgtaa acccagaata acgaactttt taaaacaaat gaaaacttta 

14701 aaattgacta aaaatacggt tttatcgcca aatatgtacc taacgagacc aatagttcgg 

14761 gggcggagct tttacaactg gccatggagc gcgttttctt ctcgatttca aaaaaatctt 

14821 ctgatgtcaa acctttttct atgtacatac gttctcataa tatacatatt ttaaagtttt 

14881 ttgtttcaaa aaagtcgcaa gattttttaa ccaaacaaaa tcgatttttt taaaatcagg 

14941 taattcaaaa aactttgcaa aaaccaacct gtttagtcat ttgcttataa ttctcaaact 

15001 cctcatcctt cttcaccatt tgctgctcca aatcatgaac tctctgccaa ctattcgtac 

15061 agtctttaca cgaagacgtt gaagattccg aagcagttgg ttgaggtgaa gctccagaag 

15121 aactcgacgg gctcactgag ctttgttcga gacatgatga ctcgagaagc tgaagtggtg 

15181 atgtcttttg aggacttgaa gcacatgatg tatcagtggc tagatcagag tccgatccaa 

15241 gaggagttgg atcattttga tggctgattg ttgtaggttg atgagtatga tggatatggg 

15301 tggaatcaat cgataaattt ggattttgat tcgcaatatg gtgcatgtac atgtgaagtg 

15361 taaaggatgg ttgactattg aattgagatt tgcagatcat acatgttaag ctgagtggaa 

15421 ttgttggagc tgaaaaatgg gaagtttctt tatttttata aaaggtttcc tgtacctgga 

15481 atctcgaacc cagcatcctt atgcatcaca cttccattca aaatattact ttcattatta 

15541 ttattattat tcagaatatt ctcgtgctgc ttaaccaaaa gtgtgttcaa tgcgttccta 

15601 tggttctctt gctgctgaat taataatttc tgagccattt ttgcaatagt ttctgatgag 

15661 agaatcattg attcctgagg tggtttgatg gaaattgcag attccaatcc agctttcata 

15721 gtgggagtga gtagagaccg aattggtgat gatctggaaa atataattat agaatgagta 

15781 aaacagtagg aaattggggt agctacgtag gcacaaaaag cgtgaaagcg cgtaggcagg 

15841 caggcatgaa gcacgcgcta cttacggttt tcccttgcac ttgagatgat gctcgtccaa 

15901 catgacctgc ttcaagaatg atcttccaca ttgttgacac gagaatcctc gttcttgatg 

15961 aatcttcatg tgccgattga gatcaaacta aaatgatcta tcgtgtttta aagttttcta 

16021 gcagattctt gcaaaagtag ccttgcaaaa gtaaaaatta aatgtttgaa cgtaccttaa 

16081 aaataaatgt tttgtgacac tgtggacatt cgtgtgggcg tccaccggaa tgaatcatca 

16141 aatgttgagt gagatgacct ttctgaaaac aatagagtaa aagttaggag tagttaaggt 

16201 aaatagattg aaaatagcaa atatctaaat ttagaagttt gcagaaaggt acagtaactt 

16261 cctttaaatg gtagtgaata ctcttaagtt gaaatttgga gtattgtaaa tacttcatga 

16321 aactactatt taatcatctc tttaaataat ttcatgaaat caggcattat tgtcaattat 

16381 tgtcgcttat gcaataaaac atagaatatt gttttttttt cactaaaatc tgaagcacct 

16441 ataaaaccac tgtttcacca tttcttcaag ttgcaataga tatgtgactg aaacaatttc 

16501 aatacaattt ttttacgcgt attttcggtt ttccgtttgt aattcgatca aaacgtagaa 

16561 tcacaaaaaa accctaaaaa aacccgataa aatttggtga aacagtaaag tctcaacaaa 
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16621 
16681 
16741 
16801 
16861 
16921 
16981 
17041 
17101 
17161 
17221 
17281 
17341 
17401 
17461 
17521 
17581 
17641 
17701 
17761 
17821 
17881 
17941 
18001 
18061 
18121 
18181. 
18241 
18301 
18361 
18421 
18481 
18541 
18601 
18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 
19261 
19321 
19381 
19441 
19501 
19561 
19621 
19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 



tgggcccggc 
acatgaaaat 
ctttgaaaat 
cttaaacgca 
tcctaattgg 
tccatcctta 
gtatgccaca 
gtacactgaa 
gcggttgttc 
tctcatgaca 
gtgtcgttgg 
cgatcttgat 
gtttttctat 
aatcaatttg 
aactttaaaa 
gacactaaat 
cattggaacc 
ttcaaattat 
taaaaaatgt 
acatgggaaa 
gattagatag 
cgaaagtctt 
aattactgtt 
cttaggcaat 
atatagacta 
gaatgtttca 
ttatcttcac 
cggaaatgaa 
gatggtgagg 
gtgtctgttg 
aatatgcaaa 
tgaccgggag 
ttgccgcagt 
aggtcatgag 
taaagcatga 
agatattgtt 
aaatgaatga 
cctacctgta 
gagatatcaa 
gtttctcctt 
gtttgaaata 
cacatttaga 
atatacagta 
gtgaaacttt 
cctcatcact 
gctcctccct 
gacacaaata 
caaaaaaaca 
aacatcagta 
gcagttaatg 
ctaaccaagt 
cggtgtgctg 
gtacttgtga 
ttctcttttc 
ggaaacttag 
agcaaacgag 
aagaaagagg 
agtcaaagca 
cgacgttacc 
aaagattgaa 



ttaatttttt 
tttacagata 
tgaaaatttt 
agcagtattg 
acaacttttt 
cctgaaaaaa 
tccaatgctt 
atgagtgcat 
tccactgtga 
atcttcacac 
attcatggat 
tttgatcatc 
aataagaagg 
aaaaaattaa 
gatttgaaca 
tataataaaa 
aatactatta 
gatcagagcg 
taccaaggtc 
aattgaaaaa 
aatatgcaaa 
tagaacaatt 
tcattacgtt 
taataaaaac 
catccccact 
ctatgttctt 
taattttttg 
taattgtaga 
gagacatatc 
gtgccaggaa 
tggctgcgag 
atgtgtgtgt 
gatggaatgg 
tgggcggagc 
aatgagaatc 
ataaggaaaa 
aaaataagaa 
ggtaagcagg 
acaagtgagg 
aaactttttt 
aaacctttac 
tagctgaaaa 
gttacagtat 
atattaaaga 
gtcatcctcc 
ctcgtgtctc 
cagtgagctc 
taaacgcaaa 
taaaaatgaa 
attggtggat 
agtcggggtc 
ctaccgatag 
gatgtttgga 
atgacgaaga 
agaaaaaacg 
ccagtagttg 
cttgactttc 
cagaacggag 
aaaatttgag 
aaatcattca 



ttgttgcaga 
tattcaatat 
tcatgaaact 
tacgcacatt 
cttttgaatt 
tgccttatta 
tttcaataat 
tatttaaatt 
attctcatat 
tcgtaacgcc 
agttgtgatg 
gcgagctgaa 
ttttgagccc 
ttaaaaaatt 
attaacattt 
aatatcgtac 
gaagatagac 
ctagtagata 
atacaaggat 
aaattaggat 
ctaatatttt 
tgaaaagtgg 
tttgtaagtt 
tttcacttaa 
aacaactttg 
gtaagttgct 
cagccaagca 
atcagaaata 
gcttatgagc 
agatcagagg 
ggagatgtgt 
gcgtggcacc 
caatgggtgt 
aggaaaatag 
actgcatgac 
gatagatttg 
aaatctgttt 
cattttttct 
ttgtggaaat 
ctcgataagt 
taaaattcat 
atcgtacatg 
acagtaattt 
aaattggatt 
atccgatcct 
tgtagttcgt 
atgatatgtc 
gaaagttgaa 
aaaagaacaa 
gtgcttgatt 
attgtgtgct 
aagaatcaat 
tgaccccact 
tagagaagat 
cttctttttt 
ggaagaagaa 
atgatgatga 
tcagaaatga 
actacttcta 
tatgaagcta 



attgaaaaaa 
attgaggcaa 
actgtttcaa 
gggtaaactt 
ttggagaagc 
caatgcggac 
tgccgagttc 
aattcccgaa 
gtgtggccag 
ctttctcacc 
aatctgtgtc 
aacaattaaa 
aagttaatgt 
aattttcgaa 
tcagattgat 
caacatgctg 
tattgaaact 
ttcaacaaac 
gcgttgacag 
cagaaatgcg 
acattattct 
gcttaattga 
gttaacttta 
aaatttttga 
aagtttaaaa 
aacttaggag 
agaaaataac 
cggatactaa 
atcacttcag 
tccgatggaa 
aagaacgcct 
aacacgaacc 
aaccatgaga 
taattatatt 
attttttggc 
aaacgaggtt 
ttcaagtaac 
ccacacctaa 
tataattgtt 
tgctctcaat 
agtgtcaata 
atgacataca 
aacggtcgtt 
tctttttcaa 
cttcctgatc 
ccaaactgtg 
tgcaatggaa 
ggattgctca 
gtgtcagggg 
gctttcacat 
cactcactta 
gatggaggta 
gccaccacca 
acagtaatct 
gtttgcgtga 
aagttaaatg 
tgaacaaatg 
aataaaaacc 
aactttaagg 
aactggcaaa 



aatcatcaaa 
aaaaaactaa 
catcttgtta 
gtacaaagtt 
aaataacatt 
aaacatggga 
caaactcttt 
aagtataaaa 
ctctcttttt 
agaagatgac 
cgataaaaaa 
tttagtgggg 
taaaatactg 
aaaaatttct 
aaaatatttc 
caataagttt 
tcatttacta 
atgaaaaagt 
gtggtctcgc 
tgcagtgaaa 
tgaaacagaa 
aacttgaatg 
taacatatca 
aaaaaaccta 
ctcaaaatag 
aaacccacta 
tgttaatgaa 
acaaaaaagt 
gattgctccg 
aaatgaatgg 
gggctcttcc 
gatcaaggca 
agatgttgat 
ggataaggat 
aacaattcta 
gataaattgt 
gggctagatg 
accatgccta 
tcatcatgtt 
cgtattctta 
atttactgtt 
gttagagtat 
tttaaagtcg 
ccagtctcca 
cccttgctcc 
aaatgcaccc 
caggactatt 
ggaaattgac 
tctggaggac 
gtactagaga 
ttcagatatt 
aagatgcggg 
cacccattcg 
aatccgcatg 
ctatttgtat 
gttttgaaaa 
aagtgaaaaa 
tttacagtac 
tagagtacgg 
aaaacaaaac 



aatttagggt 
attcaaactt 
aatatttttt 
atttactcta 
ttcttgaaaa 
ccgttctccg 
tccacattgc 
ctcacactat 
acagcaaaca 
ttttctgaag 
tcggaatctt 
taatggtgaa 
aaaaataatt 
cggaagtggc 
tcaaaatttt 
ctataaaaca 
ttttttcaaa 
gctcatttaa 
tcgcgcaacg 
attgaacaca 
aaaattttga 
aaattctaga 
ctgacaatgt 
taacaagtaa 
ttctagaaat 
aaacgctcta 
aaaagaaaac 
gaagacgaat 
ttcaactctg 
aggggggaac 
tcgatgcttc 
cgcccacttt 
cagaaaaagt 
aacatgttat 
gtctgtttta. 
gcgaattttg 
gcttgttacg 
gcctatacgt 
atttatatga 
tatttccagt 
tcaaattttc 
cagttacatg 
tttttaaatg 
gcaaaactaa 
accgtctcat 
gaacagtcac 
ttcaaattta 
attgctgtcg 
caccaccacg 
aaaaaggctc 
tgttagtaaa 
aaacctagag 
ggcttcttct 
ttgaaaaggt 
taggattgaa 
gaaattggta 
gaaagcaaag 
tcagttcctc 
ccattggggg 
tttttgaaaa 
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20221 atcttaattt aggttaaaaa agtggtgaaa 
20281 gaatatttca gcataattga aacaaactca 
20341 aataccttca cacacgcaag aattatcgaa 
20401 tcagcatttt gtttgaaaac atttgtttaa 
20461 aattgaaaaa aacggacaac aatgcacttg 
20521 tggctttatg aaaatattca atgaaatggt 
20581 tttcaaataa tttccaattt acaaattttg 
20641 aaattattaa aaattgttga actcttactg 
20701 gaggcgtttt ccgggcaact aggggcgatt 
20761 gttaaatacc taatttgcaa aacatgtaac 
20821 ttcactcctc attaatattg acaatgtgag 
20881 tttcaaaact taaaaaccta atcgaaaata 
20941 cgaaaaacag ctccaaatag tgagtcactt 
21001 agtagatgag atgtgtttgg ttttgattca 
21061 tatgttagag tttttgagtt aaacctgaaa 
21121 tttgtgtttg aacatgagca tgtttcgaaa 
21181 aattcccgtt taaaaagttg taaataaagc 
21241 aatgctgctt aaaggaaaaa attgaaacag 
21301 cttttcaatt. tttaataaat tattcgtctc 
21361 cgttcccgag agttcattct gtgccgtcca 
21421 tccccctcg't cttctctctg atatctcaca 
21481 tttctgttct gaagatctcc cccagttgaa 
21541 ggaaacagag agatctgata gcaaaaagtg 
21601 aatttaacag gaaggattgc atgttactta 
21661 cggagaaacc taaggaaaaa caaaaaaagc 
21721 cagttgttgc gcagtagaca catgctatct 
21781 gcaataaagg ggcggggttg atggtacagg 
21841 tctgaataac tatgtactaa aatagttgta 
21901 cattttgaaa ccataaattt atgtagaaac 
21961 taagttgcaa atttaatcga aaaacgaatt 
22021 tgtacaacgt aaagaaaaaa tgaagaaaaa 
22081 accgcaattt tttttcataa ttgtaaagtt 
22141 tttgcgaatt tctcatgaga tacacagtag 
22201 atgttgaaac aggataataa atttttgaaa 
22261 ttattagatt ttgaaaccaa aaaaattatg 
22321 agattttcaa agcacgatca atcgaaaaac 
22381 aagtgaaaac gaatttgcca aagaaagtgt 
22441 aattcttcac cagtagtaga aaaactgctc 
22501 ggaaaataat ttttaaaaac cataaactac 
22561 ttcatgactc tcagacttca caaaaaatca 
22621 ttggagtcca tcttgatctt gatctctctc 
22681 cttccttctg ggttctcttt ctctctcgac 
22741 tgaagaagaa ggcgccgccc gtttttgaaa 
22801 gacaggagcc cgcgccgctg gctcccgcaa 
22861 atacacacac tctttttcat actgaaaagc 
22921 gaaggagcat tgtgaatggg aattgtgaga 
22981 tgggttaggg agttatagtt atgaaatact 
23041 atatttaatg tatcaaaaaa tctacaaagt 
23101 aattgaattt ttaaaagttt ataaaatttc 
23161 atgcattcat cgttattctc aagaaactca 
23221 attgctcttt cagagttttt gtaaactttg 
23281 cactagtacc taaatttaaa taaaccttag 
23341 ttactataat ataagcataa tataaaacgg 
23401 tgaaactgtc acgtttttta gctttaagta 
23461 ctgcatcaga aaagctataa tattattacg 
23521 attttttaat ggtattttta tggtaacaaa 
23581 cattccaaga attatgaaaa atatgcattc 
23641 gctcactttc agttcttagg cagctggtat 
23701 gtcactagga aaaatacgat agcatacatt 
23761 cgcatttaaa ttcaaaaaaa ttaaattttg 



aaatcgccac ttttcaaaaa atctatatat 
gccaatttaa attctgggcg aatttttgaa 
aaatgttatc ttacacaaaa tatttagata 
aatgcattct tgtaattttg ttccttctta 
gataaaccca tacgactgac tgaaactatt 
gaaacagtag atttatgaaa tatttaaaag 
agcccgaagg tgaacaatga ccactcatag 
tcggagtata agtatttgaa aataatttca 
taagaaaagg taggcgtaag tcgcgcacca 
atgtaacatg taacatggat taatcaattt 
acagaacaaa attgttgtca cacaataacg 
ttttgaaagt ttttagttga catgaaactc 
cggtggcact tgacacaaaa tatgaagtca 
cccaacgata cgcaggcagt tgattaaagt 
cagtaatttt tggtgttttt ggagttcaca 
gaaaaacggc aatattaaca aattttttcg 
aggcaaaaaa ggtagtagtt ttattttttc 
taattttggt atttacattt aaagaatttc 
catgcttatt ccacatcacc ctggaacatc 
gtgccaccca tctggtattc agtggtggtc 
cacactcata aactacacga acaacttttt 
aaaaaaaacg aaaaaaagat gctggtggtc 
caggtggggg aaaaacgaca gggcaataaa 
gcaaaatgtg cagcaaaaat tattgtaact 
tgctcgagac ctggatgacc tgtatgtaac 
tccacaggcg gagctaacaa ctgctttggg 
actcctagat cagtgggaag gagaataact 
ttgttttgat tgttttggtc ctagatatcc 
tttgaaaccg taattttttt tctatgaatc 
tttaattacg cagagaagtc aaaattgtag 
ttgtagaaaa ttaaaagtag aaattttgaa 
aattgaattg aaatgcaaca ttgcaaagat 
agtaactgct ctagtgattc tcaaatggag 
ccatgaacta ctacagttgt ttgtgatgta 
tagaaatttt gaaaccgtat tttttttcta 
aaagatattt taaattctct atgaacatca 
ttaatatttt ccaatttata gtaattttca 
tgatgatttt caaatggaga ttttgaaaca 
ggtagctaca agaaaacaat ataatgttag 
aacaagcgtc tctagactcg cgcgctcaag 
tttggcgtgt cgtgaaaaag cgctgctctt 
gtgatatggt acttttgttg tgggaggaaa 
cgacgatgcg cgactcccaa ttcatcacgc 
gcaatacggt taccaattgg tcacaaacac 
tcttttttca cgcttgtgct ctgtcataca 
aggagaagaa tcacgcacgc agaaatctac 
agaaattcgc attttctagt tatttagaaa 
tattaccaac gttttaaatc catcattgat 
tgtttcaacc aaatcattat ttgcgtgcta 
attaaaattc caattttgtt aaaacattaa 
caaaaatgtg tgataatcat atgaaaagtt 
aaacgaacag atcaacgaat atttgaatgt 
tggaaatatg tcatgtgaga tgtgttattt 
taagcatttt tggcctctca gaaattttta 
aatgtttgaa atccaattta gaaaaatggt 
atttaactgt ttcacacaga ttatagataa 
ggtcctcttt cactgttgaa tgatctgctg 
tttgtttttt aattttttcc ggtaggctag 
atctatatta catcttgttt gatatttgat 
gttttttatc aaaacggtga attgctgttt 
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23821 cactaatctg ataagttttg taaaaactct 
23881 tcttgcaaaa acatgtgtcg atttcaaagt 
23941 ataaagaaat gagtgaaggc atttacaaaa 
24001 agaaaaaact cccaaggttt tgaaggtttt 
24061 actccataaa ttcttcaagt tcttttgcca 
24121 ccacaaacat tttgtttatt ttttgcaatc 
24181 tcattcttcc atcaccacca cgtcattcca 
24241 aaaaagcggg ctcccgttgg aaaagtcaaa 
24301 gaaaacttac caactatttt ttgggtgaaa 
24361 aacagcgccc gaaagcatcc aacggtatga 
24421 cggaagctaa aggagaaggc taacgacggt 
24481 gctggttaga ttgactcgac tgtgtgtttt 
24541 ttttcgggta aatatagaga ggtagtggag 
24601 gtgagcatat ttggtgcaaa agtcaagggg 
24661 ttgataaaga atacatagtt taattggtca 
24721 tgagcaacgc ttggaaattt tgaaatccgt 
24781 taaatataat atgatacaca acgcatttta 
24841 aaaccgaaaa aaatatattg ccaactttcc 
24901 gcccacttgc aaaacaacta acattaacgt 
24961 gcttaacttt tctaaactga gaattcataa 
25021 aaacatttaa ataacaagtg aaatagaaaa 
25081 ctgtttcacc taatctttaa gttctcatgc 
25141 tcttttctgc tcaaaaatat ttctgtcgct 
25201 aatggccgct gaatgatcgg catccgtcat 
25261 ctgcgagctg aacttttata aacaaaacat 
25321 aaaaaaaatt agtgaaagaa acttgcaacg 
25381 ttttaagttc tctttcaaat ttggactact 
25441 aacttctgca aatgaccata tccaataagc 
25501 tgatcggcgc ccttctcact ccgctggaag 
25561 aaataaattt ttaaaaaatt aatacaaatt 
25621 ctctctgtta caccaccaaa acattttttc 
25681 cacaaaagcc aagtttcatg atttttctat 
25741 aacctctaga actctattct ctcattcgaa 
25801 aattagaaaa aaaaaacaaa atttaaaata 
25861 gctgtttatt tataatcatt tttttaagca 
25921 agaggatata aacaaaaaag tataggaatg 
25981 accaagagga tacaggatgg tgtctggcgg 
26041 attggagaca ccaataaaaa tagagaaagg 
26101 ggttgccgtt tttaaattta acataatttg 
26161 ttaataaata aattggcaat tttgattatt 
26221 aatcttgctt ctcattgaca aaactaggaa 
26281 gtacattttt ttcactttgg caattggtgc 
26341 gtagcaccag tggggaaaaa gttaagacta 
26401 aaagttttat ttaaatgttt ttttgtaccc 
264 61 caatactttc agaagaaaat cataaaatga 
26521 aaaattaata tttcgcagaa aatataaact 
26581 acgtttaata ttcagatttt atggattact 
26641 accgactgac gctattcaac gttcagggca 
26701 tcttgaatat tcaaattctc aaattaattc 
26761 cttttttcca actttcaaca actcatagaa 
26821 tctttttttt gtatcaccgc taaacaaaca 
26881 aatacagtct ctgaatccta atccctatcc 
26941 ggtgaagagg gtccgattga cccaaccgct 
27001 aaccgccgtc tactttttga tacttctttg 
27061 taaaagaagt attgatttta atgtttgcat 
27121 ttacttgaat aaaatgatgg gaaatttcaa 
27181 agatattata tgaaaattgc aaaataatgt 
27241 attttattgg ttaaaattca aaaactttaa 
27301 tacttgttaa atgtttaaaa attcgaactt 
27361 tttaacctta aaagataatt ttttacagtt 



ttatttcaat gaagaagtca taattatata 
gcttcaaaat tcataatctt acgtattttc 
accatcaatt gccggtactg caaaataacg 
tcagtaggcg tctttctctc ttcattattg 
cctccacaca tgcgaactaa agaaacacta 
ctttttccat ttactctctg cttcccttct 
atatgagctc tctttactca acaccacgca 
cggtttttgt tcggttttcg gtttcgcgtc 
gtggagaaga agaagagaaa gtcaaaccca 
gcaaactccg atggagctca agcacagaaa 
gtctctatct ctaagggggt taggttgact 
tgtatgtgtc tctcgtcttt tctttccatt 
atcaatagga aaagaatgag ccagtagttt 
ttcgtaagag gatgaagcag taggttatgc 
agcacgcgag gaggggtttt ttttaacagg 
agacgtgcaa acttgttaaa ttatgcgaac 
ataaatagga aaacattaga aaaacaagtg 
acaaaaatta ctgtttcata taagttttaa 
ttacgtagtt aataaaatgt ttgccttcga 
ttaataatga acgctctgtg atagatcgga 
cactgtattg ccaactttca acgaaaatta 
aaaactgaac ttttgctatt gtatattctt 
gaccaatgtg cggtttgcaa ggggcgtggc 
aatgggtgcg aagttcaaat aatttcttgc 
tcataattaa tattgaacgc taataaacag 
tttatacaaa aatgtactgt ttcatatagt 
actaattggt attaattatc agggtttatg 
gaatggtggg ggtgaacgaa ggatgctgat 
ttcaaaaaat ttgcgtctga ctttcagaaa 
atttttcata agttcccaaa aaccctgtat 
acaaaaagcg aaaaataata ttttgcacga 
ttaaatatta aaacggtcat gaaattcaaa 
acaaagaata tatggaacct atgaacaaaa 
aaacagattc ctttgcactt atgcaatcaa 
aaacgttcga gtagagagta ataagagaag 
gaaagtggtt tactcaaatg agcacccatc 
ggagcaatcc gattacgcca tgtgtcttcc 
tggatctggt gacaagtaat aagtgtgaag 
agtggaaaaa tcgaaacgtg tttcatgtac 
ttcgatgcat gagtggtaac tacgttttga 
tttcctagca ggatgtcatc tcttcaagat 
taggcagata ttggtcgacc tgaagttgga 
cacgtgctga aatctctaaa aatcatgttt 
aaaaagtgca gattgctcag ttttcgccac 
attttttaac taatagttca tatttcgccg 
ttttgaaaaa ttggcaaaat ctctaacgga 
attacaaaaa agttttcaat aaaaaaaacc 
ggatttttgt gtgccctcat gatgtagcct 
cgaatttccc gagctttaac gaaaccctca 
ttttttccat ttcttcaagt ctacacatca 
gttctcatct cattctctgt gttgcacatt 
atttacggtg ccatgtttgt cacttgacaa 
tattacgtaa ttattatatt gtatcagtag 
atactatgaa agagttttag gcagcataac 
gatttaacaa ttcaatgaaa tagtcaaaat 
agaggattat tacaaaaaat tctgtttcac 
tttgaaaatt ggtaattttt ttaattacga 
cacattttca aacctaagta tttctggtga 
tataattaca aaattaaatg aaaagtggat 
tcatgaatta tgtaatactt tgatcctttt 
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27421 
27481 
27541 
27601 
27661 
27721 
27781 
27841 
27901 
27961 
28021 
28081 
28141 
28201 
28261 
28321 
28381 
28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 
30181 
30241 
30301 
30361 
30421 
30481 
30541 
30601 
30661 
30721 
30781 
30841 
30901 
30961 



ttcatttttg 
ttgaacaaat 
ttatataaag 
cttttttcaa 
caataaatta 
aatactgttt 
aaaccaaaag 
tttttgcgaa 
ctcgcggagc 
atccctttac 
tgtgtgtgct 
ccacattaac 
aatctttgaa 
atgatttttt 
aatagtgaat 
ttgtatatga 
gcaacaaaaa 
aattgtctaa 
ggtcggccta 
ttaaatgaaa 
catttgatat 
attgatgtct 
cgtgaaacag 
aaacagtgaa 
taatttgtga 
ttatattatg. 
tttaaatgtt 
acttttatga 
tctcattcaa 
aataattttt 
ttctttaaca 
ttctcacgta 
ttttcgttgg 
caaacatact 
gcatccgctt 
ggaacaacag 
aaaaatcaga 
tgaattcaaa 
ctcgtgaata 
ccatgcgtag 
acttgttttg 
gctcaagtac 
tcgccttcta 
gaaataagtt 
tgagatcaaa 
ataaaaatgc 
ttaaaatctg 
atgcgaaaaa 
tttcatggca 
tatttaaatt 
ctccgcctat 
cactgtttat 
gtcttgctgt 
caaaatttta 
acctgttgtt 
attttcaccc 
atattgggat 
atcttgtttc 
ttgtcccctg 
gtaataattg 



tttacactaa 
ctaaaatttt 
atttccaaaa 
attttttatt 
agatttaaaa 
cgttgactaa 
agcaagaacc 
ataccgcgcg 
atgtgtcact 
ataccaccgg 
tgcaagtgac 
aggtggaaaa 
aaatcatgga 
ttagttttag 
catataactt 
tcaattagaa 
tgaaaatggt 
aacgctttag 
aaaatcgcaa 
aaaaaacata 
ttttcgaaaa 
ctaggtagag 
tgattattca 
ttttttgaaa 
ataacgtaag 
aaaactggtt 
gttttttaat 
tcacaaattt 
agtttgagaa 
acgttttcaa 
acagaaaaag 
cattcaaaaa 
aacagtaaat 
tgtaaaaact 
ggaacacctg 
aacacccaaa 
aaaacttctt 
acacggcggc 
ggtcaacttg 
ggtctctttc 
caggaaaatt 
ttgtcacata 
ggcacataat 
gtataataaa 
cgctgaggca 
agatttcaca 
aaagtattaa 
ccaacaaata 
cgtttgttaa 
ttaatgcgaa 
aaatatcttt 
gttcaattat 
aacacaagta 
ctgtttcaga 
caatcgacat 
acatcttcag 
aaatactgtt 
atgcatggac 
ggatacggag 
actttttggt 



cagcaggcca 
acggcgttac 
agttagattc 
ttgagaattt 
ctattttttg 
cacctattca 
atgttaacct 
gcatccgcca 
ctgaaggaaa 
ttgtaatcta 
cacttgatgg 
aaatggaggt 
aaatcacaca 
tattgtgaga 
aaaaacaatt 
aaagcaaaaa 
ttctccaaaa 
aatatttgaa 
aatttgaaaa 
tatctgcaaa 
aaaaatgaca 
tggaattatg 
aaacgtacaa 
caaattaatc 
agacctttaa 
gcatgctcat 
aaaggcgcaa 
ttgcagaaaa 
agtttataat 
gaaatatttt 
tttttgagca 
gtgttggaat 
aatttttggt 
gaaacatttg 
acgttatctc 
tgtctcagaa 
cttcctcttc 
ttctgataaa 
atcggtggct 
cgttggacca 
ttaaaaatga 
ggctctttgg 
gcttatgcct 
tgtgtcattc 
aataggtagg 
tgtaatttaa 
aaattgtaag 
tgcgggccaa 
ctttaatttt 
taaaattaag 
gatcaatcgt 
aaatggaaat 
tacatagtga 
aatgaagttt 
tgtttaatcg 
cttttttatt 
gtattgtatg 
tcttttggca 
gaacttttga 
tgttgcaaaa 



aaatagcaaa 
ataacttcaa 
tattgaccac 
gtttaatgat 
ctgtttcgta 
tgaattctga 
catccatctc 
ccagccgcgc 
tgatccttct 
tatctcgcgt 
tccagcggct 
gaatgggaga 
attatttgaa 
tgaatgattt 
ttcttaatca 
tctactgaaa 
taagtctcta 
acatgtgtag 
ataaaattaa 
acacgactga 
attttcgagg 
atacctaggt 
attcagatat 
aggttttaaa 
aaaagaaaac 
ttccagagta 
acagctcagt 
aattaatatt 
attacaatat 
atactttcaa 
aggttagttg 
cgaattcaaa 
tttgaaaatt 
ttttaaaaac 
tttgctcgat 
atgagaaaaa 
cactagagaa 
aagaagatag 
ccccgagccg 
actgtttttg 
aacgagatca 
agttttttca 
tcaagacgcc 
gtctatctga 
cgtgtagaca 
aaatgtatct 
aaaacacttt 
tcgtgaaaag 
caattaaaaa 
aaaaaactca 
acaggttcac 
aaactaatta 
aacagtaatt 
ctgattaaga 
acctgttagt 
taccaattgg 
attggagagg 
aacatacaac 
ttttggaaat 
caatactgtt 



atggcatttc 
gaaattctgt 
cattgcttac 
tcgaaagtta 
attttcaata 
ttaaaaatat 
ctcctccaac 
tttgattggc 
ctgtcttctc 
gcgcagagag 
tcttcttttt 
tgcaagactg 
acattgaatt 
actaattttt 
gtggttttca 
tttaaacagt 
aacaactcaa 
tcaatccgat 
ttaatttttt 
ataaaaactt 
aaattttcac 
gcaaagtaat 
aagacgaaaa 
taaagaaatt 
ataaaatatg 
cttacagtga 
aatgcaaccg 
aataaaaata 
gagattgaga 
tgaaaatact 
gttggaatat 
tttcatggag 
ttgcaatatt 
cataatctct 
ttaatctcta 
gatgaaaggg 
atccaccagc 
tggtcatctc 
gtgccaggag 
cgatgaaacg 
agcagttttg 
gagggagcat 
ctaccttgtg 
agcctacact 
cgcccgaaag 
ttaaatcccc 
aattaatttt 
ttcagaaaaa 
caatttcaaa 
ccaaagttta 
aacgttttcc 
ttttcaaatc 
tttgaaccaa 
atgtgcaaac 
tttaacatta 
attagtgatg 
atttgagagt 
atgatgggac 
ttttttactg 
acccaacgtt 



attaaacttt 
ttcgcagatt 
tattagcaaa 
ttaattatta 
cttttatatt 
atcccacaca 
agttgtcttt 
tccgcaacgg 
tttagttacc 
agagacaatg 
gtgctttgga 
ttgatggaat 
tttgaaaatc 
tttaattgtg 
aaagcagtag 
taatttgtaa 
agtaaggtaa 
ttgacagttt 
tttgttttca 
aattatgaaa 
aaagacgtca 
ttatgaacta 
acattatttg 
gccaaatttt 
aatatagaga 
cagggatttt 
aagtttacac 
taagaaaagt 
gtcttttgag 
gttttgttgc 
agaaaactgt 
gacattaaat 
ttctagacat 
aaaaataagt 
atcaagtaac 
aagcatgaat 
tgatgtcggt 
ctgcgttgtc 
catgaagcca 
cacacacaaa 
aaattgaagt 
gtggagaagg 
cctacaggct 
gtaaaaaaat 
cctatctaaa 
aaaaatgatt 
ttaaatgtta 
cggtttgtaa 
tttgattata 
aaaaaaaaat 
tgtgttttaa 
aacatcacgt 
catttttttt 
acttcaatcg 
atttgaacaa 
gtagatgtac 
gtgtgtcggt 
aatgtgctaa 
tttcaaagcg 
tttaaaaact 
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31021 
31081 
31141 
31201 
31261 
31321 
31381 
31441 
31501 
31561 
31621 
31681 
31741 
31801 
31861 
31921 
31981 
32041 
32101 
32161 
32221 
32281 
32341 
32401 
32461 
32521 
32581 
32641 
32701 
32761 
32821 
32881 
32941 
33001 
33061 
33121 
33181 
33241 
33301 
33361 
33421 
33481 
33541 
33601 
33661 
33721 
33781 
33841 
33901 
33961 
34021 
34081 
34141 
34201 
34261 
34321 
34381 
34441 
34501 
34561 



tattcttatc 
tatatcgtca 
ccaaactttt 
gagagacgtg 
aatgtgatat 
aggtcggtaa 
gactttcaaa 
acaatttctt 
tttgttctgt 
atattttaat 
aaccgagggg 
tgtttttgct 
tattatcaag 
acgaaacaga 
ataaatatct 
agtatttcag 
acaaatgcaa 
cattatacac 
ttacagtaag 
aattttttcg 
aaaaaagaca 
atatcattca 
aatattctta 
tcaatatgtt 
aaacacatta 
cataaggttt 
gacacgcgct 
ttaaaaatgg 
ggttccttcc 
aacagactct 
ttttttggtg 
ctttttgcgt 
gggtatcaat 
acttggtccg 
ctgtttatac 
aagacataaa 
cagacaccca 
aacaatcgag 
aggaaacttg 
gtttcaaaag 
tttcatgttt 
ttttttttgg 
attaatgcct 
ctcctgcgat 
aaacttctgc 
caagcccagc 
gcgggtctta 
agtcatggag 
gcagtatagg 
cttactgtcc 
ggtgtcgcgg 
acccatttat 
agcagcgaaa 
cgagttgacc 
ggggatgctt 
ctctcgaaat 
gatcgtagtt 
agttcttgcc 
ggaactcgta 
ttttgaacag 



atgcaagttt 
taaagtagct 
ccatacttca 
gaagctggct 
ttgagaagct 
aaggaggggg 
aattaaaaga 
aaataattgc 
ttctcgacgc 
atattcccat 
cacggttcgg 
gaaaattttg 
attatttata 
aaaataatgt 
tatcatgtac 
gagtttttta 
actgtaatat 
atagcttatc 
gttgttctta 
aacattttta 
gttgggagtt 
gctattgttt 
taatattttg 
tataccagcc 
gatatacggt 
cccaaaattt 
ttatatatag 
cgtatataaa 
cgtgaagtat 
tattgtttat 
agaatccttg 
tcacttctgg 
ccatccatag 
ttctctgcgt 
cggacatgag 
tgatatgtga 
taggcacccg 
acattgggaa 
gaatttcaaa 
attcacatca 
caatataagc 
aagtagtatt 
atgacttctt 
taggaggcaa 
ccggcaaggt 
gaatacgatg 
aaagttgtat 
cagcaacaac 
ccagcgttga 
cgcccaattg 
tttcctcgac 
cattttcagt 
gagagaggtc 
gaacttttct 
gagagtcgca 
atcctcatcg 
ctttttcgtg 
taacactaca 
cagaaaaacc 
cacccaaaag 



cttagtcgaa 
tctttcttac 
cagtcaatgt 
ggtggttgac 
gagcaagaag 
aggaagtgta 
caatatatgg 
gtggaattta 
ctggcctgta 
ttacgatata 
tattgttttt 
aaaaaaaatg 
tttaattttt 
ttaagaatgt 
tccgtattag 
aaaattaaaa 
cgttttcata 
aggtttttaa 
ttgactgcca 
gaaaaaaata 
tttgttgtta 
tttttcaaac 
gcttttgtcc 
gtggaccgca 
gtctatcgcg 
cgttgctctg 
tgtttatata 
ttaaaataag 
tgtttatttt 
tgaacattaa 
aatgccaaaa 
ccgatcttct 
ttttctgctg 
ctctcactgc 
tttggagtaa 
gagctacttg 
atgaagaata 
taatgtcaaa 
atgttgagca 
aaaaacttct 
tatttattat 
ttttcaaagc 
gccaacacaa 
agaagtcgtc 
gattctggag 
ggtaagacaa 
gcggtcttgt 
gatgtaatcg 
ttgtagcaca 
agagaagctc 
cgggaaccac 
aatgatatct 
acaaagcaat 
atctttccgg 
gaaagttcgc 
atatccaaac 
aacttttcga 
cacatgttac 
cctcggaggg 
ttttttcaaa 



ttaagaaact 
ttgtgccacc 
gttttgttta 
tgagagctca 
tcattgacac 
agttaacaag 
gaaatacatt 
aatttcagtc 
tttctagttt 
tcaaaattat 
aatcaactat 
tgtactgttt 
ttttcattgt 
ttgtaaaaaa 
aaacaacgtt 
tttatgtttt 
agttttactt 
attgcgtctg 
tctttcaact 
agcgagtctt 
tccaaattgc 
gtcacttttc 
aattttatgt 
cctacggcgc 
gacctcgatc 
atcagagatc 
ttctcctttt 
taagtgaaaa 
tcatcttcaa 
ggattgtgtg 
actcgattac 
catacacttt 
ctgcgcggga 
tccacacttt 
cgagatagtg 
ctgcatgtgg 
agaagaatta 
aggggtttct 
tagttcagaa 
ttccgaaaaa 
gttggtgaaa 
gaacaagatt 
cggaagttga 
caaccccgtt 
tgagcggaac 
aatttggatt 
gttgtcatgg 
atgagcatgg 
gtctggaaga 
ttttttccgt 
tcttttttcc 
ttaacccagt 
ttttctgaga 
ttttttggag 
gtgacgttgc 
gagagcgatc 
accaaaactt 
gacgagcttc 
cgtgacattc 
tatattaaac 



tttctatttc 
tctcacctac 
ccgtcacgcc 
acaaagagca 
aaaagaacac 
gggatctaaa 
ttttacaaaa 
gcttacagct 
ttcattgtta 
tctcctggct 
ctggaaaaaa 
actgttttaa 
atttagaata 
ccgttttgaa 
aaaatatatt 
ttagtggaaa 
ccagatatta 
tttcaggttg 
ttttaatagt 
cccagaaatt 
caattgaaat 
aaaacctagt 
taatctgctt 
ggcccccgac 
gatgtctgct 
aaactacata 
tctatatatt 
agcaactgtt 
cttttaaacc 
caatttttat 
tatgttttcg 
catcctgaac 
tgtacacgac 
ctcgccgatg 
tgtgatttga 
tgtgcgcgga 
ctttacggcg 
gaactttttt 
tatgggttaa 
cagtcatatg 
aagtctttgc 
caatgggtaa 
acagtaccct 
tcgacgacct 
agatggtagt 
ccgactgtct 
agcagcaaca 
accacctttt 
agctctctgg 
gaagttctca 
tggtatggct 
ccgttgttcg 
atcggacaat 
atggttgatg 
tcttgaatca 
ttatcttggt 
catggtattg 
gttgcaattg 
ggcgagcaaa 
atgcacatta 



gttcctagaa 
caatgtttga 
attcaaaaaa 
tgtagacaga 
ctacttaaga 
gtaccaagtt 
gaaaaacact 
tccagtcaaa 
cgactgttca 
gtaattcaat 
agttggaaac 
acctttttca 
aagtagcttg 
tacaatttcg 
cttgaaactg 
caatactttc 
tatttaaaac 
agatgctctt 
cattcattga 
tgtttgattt 
tttcatgaaa 
tgtaaaacat 
tctgtcatgt 
ttttttgaga 
gcgcatttag 
gggacagaag 
ggaattattc 
gaacatttcg 
ctagttgcaa 
gaaatctgtt 
agtgatggat 
acatgacggt 
gccagaaact 
ggttacactt 
ggagtacgca 
tcaaactcaa 
tgagatgaaa 
cctgtttgaa 
atgatttaag 
caggaatccg 
aaaattttga 
tatatttacc 
ccgcctagaa 
ttcgatcatg 
cagtaggagc 
ggagcttctg 
actttggagc 
tcaattaaat 
aaatgacacc 
tttgaggtca 
aacatacaat 
tttcctagtg 
ttgtgaggaa 
atgattcttt 
tcttccaggg 
ttatcatcga 
taggtgacag 
cagctttggg 
ttctcggtca 
cgtactatgc 
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34621 
34681 
34741 
34801 
34861 
34921 
34981 
35041 
35101 
35161 
35221 
35281 
35341 
35401 
35461 
35521 
35581 
35641 
35701 
35761 
35821 
35881 
35941 
36001 
36061 
36121 
36181 
36241 
36301 
36361 
36421 
36481 
36541 
36601 
36661 
36721 
36781 
36841 
36901 
36961 
37021 
37081 
37141 
37201 
37261 
37321 
37381 
37441 
37501 
37561 
37621 
37681 
37741 
37801 
37861 
37921 
37981 
38041 
38101 
38161 



aaataaaaaa 
attttgcaaa 
ccaaactaaa 
tacagtttca 
ttattaggga 
ataaatattt 
atattttaat 
ggggaattga 
ggtgtagaga 
aaaaacccag 
aaaatcgaaa 
atactggact 
cgcatttgtc 
aataagtttt 
gattatttca 
tcggccatct 
tcatgcatat 
ggatacaaga 
tggaagaatt 
aggtttgata 
ggcctttggt 
aacttagaca 
taacacaagt 
gtcggaaact 
ttagaaaaaa 
actcgttccc 
aactggggac 
tcttccttga 
ataggcttga 
attacctgac 
acttacgtca 
catggtgttt 
cagtcataaa 
ctcacttcac 
ccagaatcat 
gttttattaa 
aaatgaatct 
tagtcgtaat 
cggaaaacgg 
gcttggcgtc 
tattcaagga 
catcgacttc 
cgacttcttc 
tctgaaaaaa 
atattttttt 
aatcacttta 
attttttcct 
cgaaattttt 
tagatacact 
ggtttgtttt 
gttctgtaac 
agcgtgatgt 
tgacaagatc 
cacaacatca 
ataagttccg 
atttcgactc 
ttaaacatca 
agtcctttga 
taatagaggt 
ctggatttca 



acagaattgg 
gactttttga 
atatttatct 
ctattcgttt 
aatttagttc 
ctgaaaaaat 
tctattgccg 
atattgaata 
aataagctta 
gggagacgca 
tagaagaatt 
tatgcgttaa 
gcatcaaaat 
cactggttca 
gtatttaaga 
cggctgtaag 
aacaaagatc 
ctgtatattt 
tattataatc 
tttggtcctg 
tccagtgcct 
aaatttggac 
tttgcttaat 
tcttggaaac 
tccacatata 
ctctcgcgta 
tatatttttc 
cgatactgaa 
tttttcgagc 
cgaaaacctg 
cttcctgatt 
gagacaaagc 
cgttcaatgt 
tagaatacga 
tccaagtctc 
taatgttgga 
cttgggtaag 
gctttctagt 
catgaacgtc 
aacgaaaaat 
agtttcttgt 
aacggattct 
aacaagtctt 
tataacctgc 
gggtttttgt 
aattttacga 
aaaaagtgca 
atttggttgt 
ctgtgaaaca 
aattgcaaat 
cagatgttgg 
ttttatttta 
ctttccttcg 
attccggtgg 
gatatcttag 
cttcgttcac 
ctgcatcttc 
atgcttgtga 
tatacgacat 
ctgaaaattt 



aaaaaaataa 
ccaacctaat 
gaaaaatcag 
gaagaatata 
tctgatatca 
tactgtttca 
aaatggattt 
ttgagattta 
acaaacaaca 
tcggagattt 
tgcaagatac 
atctagtgtc 
caacgattcc 
aaagtattca 
agctcaccca 
gtatagtggc 
gttagtcatc 
cgttgctttg 
tggagacaca 
aaaaaaatat 
tccccgttag 
aaggttttgg 
ccaaagtctt 
atttgatagc 
cttccaaaaa 
gagtctcgta 
tccgagcgtc 
ccgactttca 
ttttcgggca 
aaatatatgc 
ttacgtattt 
caattgtcgt 
cctttcgtaa 
attgagataa 
actgttgacc 
acaagtttca 
aacatcgtat 
gaccaggaat 
gttattgtga 
cccgtttgag 
ccatctcgga 
ccgacagtga 
tctgttacat 
gtactttttt 
ttggagaaaa 
atattcttta 
aaataaatcg 
acctaagttc 
gttaaaaatt 
tagcaaatct 
atgacggagc 
aaaatttcag 
ttgttaatca 
aaagtagagc 
atgaattccg 
aaatctttgt 
ctgtagattt 
cagcttcttt 
tatcagctcg 
catttgataa 



attaacgctg 
aaataataaa 
aggtcgttaa 
tacgtttcaa 
cgtgactttc 
aaagtaaagc 
aaaggcatca 
tgattgtaga 
tgttggttaa 
acttttcagt 
gaatacattc 
ggcgaggcct 
ttggaagatg 
tataataact 
tttttctccg 
gctggaaggc 
cgctgaatcc 
catgtccctt 
attacagagt 
tgtttggaat 
ccagaaaaaa 
aaagacttga 
gccaaaagga 
ttatgaaaac 
gatctgccta 
ctgtggcctt 
ttgaaaaccg 
ctcatcccac 
cttctgaggg 
acttgtgtaa 
gtagcctcca 
gacattctga 
ttcaaaactt 
ggtgctgtcg 
acataattca 
gctctctggt 
tcctgctcca 
atgatttgct 
ataaaatcaa 
caaaatcggt 
agaaaatacg 
taaaatcgtt 
catggcgaat 
ttattttttg 
actaaaaaaa 
gcaaaatgta 
aaattcaata 
atagcagccc 
tcaactttta 
ttcgaaaaca 
ttggtagatg 
tctaaaaata 
ttctatctgt 
aaagatgttc 
aattttcttc 
tctcttgaag 
ttatttttgc 
ccatctctgg 
gaaagtggtt 
ttaaaattta 



agcaatgaag 
tgtatcctct 
accgctaggc 
cacagtctta 
tgtagaaaca 
ttataattca 
ggtgagaaat 
gggttcagat 
aaaagtcaat 
gaagatcaat 
tgagaaaaaa 
ttatttctgt 
ataggtctga 
acttctaaac 
cgagattttt 
tgactggatg 
attccatttt 
gacgggcaac 
caacaatggt 
gagtaaacta 
acgtggatca 
actttggttg 
gggattcaaa 
agtatgatat 
gttttcccca 
tggatcccac 
aatgacaatt 
acaagtaaat 
aattgagaag 
taaattgaga 
gcaaactttt 
aaattgattt 
tcccaaaaat 
aaagtccaat 
atccaccaag 
acaaatttct 
aacttttgaa 
ttcccgagtt 
tcgaaggaaa 
tgggtttctg 
cggagttgga 
gaagtaattc 
gaatggagat 
gtattttgac 
aaaatcacat 
gtttttaacg 
tattaatttt 
tgatcatgac 
aattgacaaa 
gtgtgctcga 
cccggcgcag 
ttataaaacc 
tcaccttctg 
ttccttcagc 
taaaataaga 
aatgtgaaaa 
catagttgac 
aattgcattt 
ggaaatcagg 
ggccagatgg 



aaacaaacaa 
caatggattt 
ctaccattcg 
ttacaaaata 
catctagagc 
gaacacacaa 
atatttatgt 
gttaaaatca 
tgagataatt 
ttcagagagg 
acgtgttgaa 
acccgtattt 
aacttgctaa 
aagtatttga 
agttccacgc 
aaaaacgatc 
tggtgactcc 
tgcaccacaa 
tcctggctgc 
taaattcggc 
gaaattgggt 
caccaaagat 
agttgaactc 
taaatacttt 
aattgaactc 
aataatcgtc 
tctgacgtac 
aacaatgttc 
aatttcaggc 
tctatgtaag 
gtggatgatc 
tattcggctt 
attaaattca 
caacaacaat 
cctgacatta 
agctgtctcg 
aaagttatga 
tttggctctc 
ttgtactgct 
aaggattggt 
agcacaattc 
gatttcaatc 
gcagatgcca 
gaaaaaaaca 
tccatacttt 
ggttttaatg 
gtttttagcc 
cgatgatgaa 
caatgaaaca 
caattcacga 
atatatgaga 
caaattacct 
cgatggacta 
cacaactgtc 
ctccacagta 
cccctttatc 
agagatgttc 
tattccatga 
atggaatagg 
aatttgtttt 
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38221 gagttaggta aacaattttg aattgcattt 

38281 tctgagaatt aacataatta aattttggca 

38341 attgaaaccg taatctcgtg aaatgttatg 

38401 caatacattt taatcaaatt gtcagaagaa 

38461 gatcctcttg agaatagacc tccacataaa 

38521 tgcttttttc taatgacgtc ttacagtaaa 

38581 aacaaatacc aaactagaac taattaacat 

38641 acgaaaaact gtcctgttgt attctgctga 

38701 agcatgtgag cggtcttttt ttctgaacac 



ttcaattttt tttcaatttc cttgaatgtg 
attaaaagtc agtgtactta tgaaaaatat 
taaatcccaa gtatcaaaac ttctttcatt 
gtcagtagta atttgcaaga aatctgttca 
cgaaaattta ttgcgattgg tgtgacgtca 
ttaacgtcac attcgaaata ttggtttttt 
ttaataaatt gaaactgact atccgttgtg 
ccttctcctt gaactggaca atttgaaaga 
atagttcacc gttacatggt 
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DEFINITION 
ACCESSION 
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KEYWORDS 

SOURCE 
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REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 
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AUTHORS 
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REMARK 
REFERENCE 
AUTHORS 
TITLE 
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COMMENT 
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source 



gene 
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HSMKLP1 3348 bp mRNA linear PRI 19-JAN-2000 

Homo sapiens mRNA for mitotic kinesin-like protein-1 (MKLP-1 gene) . 
X67155 S46300 
X67155.2 ■ Gl:6723674 

ATPase; kinesin-like protein; microtubule-associated protein; 
mitotic protein; MKLP-1 gene. 
Homo sapiens (human) 
Homo sa pie ns 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 3258) 

Nislow,C, Lombillo, V . A. , Kuriyama,R. and Mcintosh, J . R . 
A plus-end-directed motor enzyme that moves antiparallel 
microtubules in vitro localizes to the interzone of mitotic 
spindles 

Nature 359 (6395), 543-547 (1992) 

93024924 

1406973 

2 (bases 1 to 3348) 

Gryka, M. A., Shiekhattar, R. , Nigg,E. and Mcintosh, J. R . 

Unpublished 

3 

Nislow, C.E. 
Direct Submission 

Submitted ( 03- JUL-1992 ) C.E. Nislow, University of Colorado at 

Boulder, Dept MCD Biology, Campus Box 347, Boulder CO 80309, USA 

revised by [5] 

4 (bases 1 to 3348) 

Gryka, M. A. 

Direct Submission 

Submitted ( 13- JAN-2000 ) M . A. Gryka, c/o R.I. Mcintosh, University 
of Colorado at Boulder, Dept MCD Biology, Campus Box 347, Boulder 
CO 80309, USA 

On Jan 20, 2000 this sequence version replaced qi : 34 67 1 . 
Location/Qualif iers 
1. .3348 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/cell_type="HeLa cells" 
1. .3348 

/gene="MKLP-l" 
61. .2631 
/gene="MKLP-l" 
/codon_start=l 

/product="mitotic kinase-like protein-1" 
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/protein id=" CAA47628 . 2 " 
/db_xref="GI : 6723675" 
/db_xref="GOA:Q02241" 
/db_xref="UniProt/Swiss-Prot :Q02241" 

/translation=' f MKSARAKTPRKPTVKKGSQTNLKDPVGVYCRVRPLGFPDQECCI 
EVINNTTVQLHTPEGYRLNRNGDYKETQYSFKQVFGTHTTQKELFDVVANPLVNDLIH 
GKNGLLFTYGVTGSGKTHTMTGSPGEGGLLPRCLDMIFNSIGSFQAKRYVFKSNDRNS 
MDIQCEVDALLERQKREAMPNPKTSSSKRQVDPEFADMITVQEFCKAEEVDEDSVYGV 
FVSYIEIYNNYI YDLLEEVPFDPIKPKPPQSKLLREDKNHNMYVAGCTEVEVKSTEEA 
FEVFWRGQKKRRIANTHLNRESSRSHSVFNIKLVQAPLDADGDNVLQEKEQITISQLS 
LVDLAGSERTNRTRAEGNRLREAGNINQSLMTLRTCMDVLRENQMYGTNKMVPYRDSK 
LTHLFKNYFDGEGKVRMIVCVNPKAEDYEENLQVMRFAEVTQEVEVARPVDKAICGLT 
PGRRYRNQPRGPVGNEPLVTDVVLQSFPPLPSCEILDINDEQTLPRLIEALEKRHNLR 
QMMIDEFNKQSNAFKALLQEFDNAVLSKENHMQGKLNEKEKMISGQKLEIERLEKKNK 
TLEYKIEILEKTTT I YEEDKRNLQQELETQNQKLQRQFS DKRRLEARLQGMVTETTMK 
WEKECERRVAAKQLEMQNKLWVKDEKLKQLKAIVTEPKTEKPERPSRERDREKVTQRS 
VSPSPVPLLFQPDQNAPPIRLRHRRSRSAGDRWVDHKPASNMQTETVMQPHVPHAITV 
SVANEKALAKCEKYMLTHQELASDGEIETKLIKGDIYKTRGGGQSVQFTDIETLKQES 
PNGSRKRRSSTVAPAQPDGAESEWTDVETRCSVAVEMRAGSQLGPGYQHHAQPKRKKP 



ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 



gtactcctca 
atgaagtcag 
aaccttaaag 
gagtgttgca 
agactcaacc 
actcacacca 
attcatggca 
acaatgactg 
aacagtatag 
atggatatac 
aatccaaaga 
gtacaagaat 
tcttatattg 
cccataaaac 
tatgttgcag 
tggagaggcc 
tcccatagcg 
gtcttacagg 
agtgaaagaa 
aatcagtcac 
ggaactaaca 
tttgatgggg 
gaagaaaact 
cctgtagaca 
ggtccagttg 
ccgtcatgcg 
ttagagaaac 
gcttttaaag 
caagggaaac 
ctggaaaaga 
atctatgagg 
cagcgacagt 
acgacaatga 
cagaataaac 
cctaaaactg 
agatctgttt 
attcgtctcc 
gcctctaaca 
tctgttgcaa 



accactctcc 
cgagagctaa 
acccagttgg 
tagaagtgat 
gaaatggaga 
cccagaagga 
aaaatggtct 
gttctccagg 
ggtcatttca 
agtgtgaggt 
cttcttctag 
tctgcaaagc 
aaatatataa 
ccaaacctcc 
gatgtacaga 
agaaaaagag 
tgttcaacat 
aaaaagaaca 
ctaaccggac 
taatgacgct 
agatggttcc 
aaggaaaagt 
tgcaagtcat 
aggcaatatg 
gaaatgaacc 
aaattttgga 
gacataactt 
ctttgttaca 
taaatgaaaa 
aaaacaaaac 
aagataaacg 
tttctgacaa 
agtgggagaa 
tctgggttaa 
agaagccaga 
ctccatcacc 
gacacagacg 
tgcaaactga 
atgaaaaggc 



taatgattgg 
gacaccccgg 
ggtatactgt 
caataataca 
ctataaggag 
actctttgat 
tctttttaca 
ggaaggaggg 
agctaaacga 
tgatgcctta 
caaacgacaa 
agaagaggtt 
taattacata 
acaatctaaa 
agttgaagtg 
acgtattgct 
taaattagtt 
aatcactata 
cagagcagaa 
aagaacatgt 
atatcgagat 
gcggatgatc 
gagatttgcg 
tggtttaacg 
attggttact 
tatcaacgat 
acgacaaatg 
agaatttgac 
ggagaagatg 
tttagaatat 
caatttgcaa 
acgcagatta 
agaatgtgag 
agatgaaaag 
gagaccctct 
tgtgccttta 
atcacgctct 
aacagtcatg 
actagctaag 



aacaaaagaa 
aaacctaccg 
agggtgcgcc 
actgttcagc 
actcagtatt 
gttgtggcta 
tatggtgtga 
ctgcttcctc 
tatgttttca 
ttagaacgtc 
gtagatccag 
gatgaagata 
tatgatctat 
ttgcttcgtg 
aaatctactg 
aatacccatt 
caggctccct 
agtcagttgt 
gggaacagat 
atggatgtcc 
tcaaagttaa 
gtgtgtgtga 
gaagtgactc 
cctgggagga 
gacgtggttt 
gagcagacac 
atgattgatg 
aatgctgttt 
atctcaggac 
aagattgaga 
caggaacttg 
gaagccaggt 
cgtagagtgg 
ctgaaacaac 
cgggagcgag 
ctctttcaac 
gcaggagaca 
cagccacatg 
tgtgagaagt 



aaaaaaagaa 
tgaaaaaagg 
cactgggctt 
ttcatactcc 
catttaaaca 
atcccttggt 
cgggaagtgg 
gttgtttgga 
aatctaatga 
agaaaagaga 
agtttgcaga 
gtgtctatgg 
tggaagaggt 
aagataagaa 
aggaggcttt 
tgaatcgtga 
tggatgcaga 
ccttggtaga 
tacgtgaagc 
taagagagaa 
cccatctgtt 
accccaaggc 
aagaagttga 
gatacagaaa 
tgcagagttt 
ttccaaggct 
agtttaacaa 
taagtaaaga 
agaaattgga 
ttttagagaa 
aaactcagaa 
tgcaaggcat 
cagccaaaca 
tgaaggctat 
atcgagaaaa 
ctgatcagaa 
gatgggtaga 
tccctcatgc 
acatgctgac 



aaaaaaagcc 
gtcccaaacg 
tcctgatcaa 
tgagggctac 
agtatttggc 
caatgacctc 
aaaaactcac 
catgatcttt 
taggaatagt 
agctatgccc 
tatgataact 
tgtatttgtc 
gccgtttgat 
ccataacatg 
tgaagttttc 
gtccagccgt 
tggagacaat 
tcttgctgga 
tggtaatatt 
ccaaatgtat 
caagaactac 
tgaagattat 
agtagcaaga 
ccagcctcga 
tccacctttg 
gattgaagcc 
acaatctaat 
aaaccacatg 
aatagaacga 
aacaactact 
ccagaaactt 
ggtgacagaa 
gctggagatg 
tgttactgaa 
agttactcaa 
cgcaccacca 
tcataagccc 
catcacagta 
ccaccaggaa 
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2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 



// 



ctagcctccg 
ggtggtggac 
ggtagtcgaa 
gaatggaccg 
ctgggacctg 
cccagtactg 
cagtcttcca 
atcatacact 
aaccttgtat 
ttcatcactg 
aactaattca 
tgatttttta 
tgctgctatt 
aaataaagat 
tatgtatata 
attcctgcag 
gcttttgttc 



atggggagat 
aatctgttca 
aacgaagatc 
atgtagaaac 
gatatcagca 
aaagaacatt 
ggtcatcttg 
gacccagagc 
agtgtatgtt 
tatgaatttt 
cacaagtgtt 
ttgtgacttc 
tattagctgc 
gtaaatgact 
ttcactttct 
cccgggggat 
cctttagtga 



tgaaactaaa 
gtttactgat 
ttccacagta 
aaggtgttct 
tcacgcacaa 
ttcatttgtg 
tagaactcca 
aaagctttcc 
ttgccatatt 
ttataatgtt 
tgtcttagat 
tccagccctg 
aagaatgcac 
ggccaataat 
gacatttaga 
ccactagttc 
gggttaattt 



ctaattaagg 
attgagactt 
gcacctgccc 
gtggctgtgg 
cccaagcgca 
tggatgattt 
gctttgttga 
ctatggttca 
taatattaat 
tttttaaaat 
gattaaggaa 
gtctgaattt 
tttagaacta 
aaccatttta 
tatgccaaaa 
tagagcggcc 
cgagcttggc 



gtgatattta 
taaagcaaga 
aaccagatgg 
agatgagagc 
aaaagccatg 
ctcgaaagcc 
aaatcacgga 
aagacaacta 
agcagaggaa 
atatttcatg 
gactatatct 
cttaaggttt 
tttgacaatt 
ggaaggtgtt 
gaattaaaat 
gccaccgcgg 
gtaatcat 



taaaacaagg 
atcaccaaat 
tgcagagtct 
aggatcccag 
aactgacagt 
atgccagaag 
cctcagctac 
gtattcaaca 
gactcctttt 
tatacttata 
agatcatgtc 
tataaacaaa 
cagactttca 
ttgaattctg 
caaaagcgga 
tggagctcca 
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OZOODO 1 




K\r\\i HO OnfM 4-fY7 DM 
NOV l/i ZUU4 \.Kjl rlvl 


Live 






3258581 




Oct 25 2004 4:09 AM 


Dead 




® 


3258581 




Aug 25 2004 4:37 AM 


Dead 






3258581 




Nov 19.2002 1:04 PM 


Dead 




® 


3258581 




Aua30 2002 3:10 PM 


Dead 






3258581 




Aug 28 2002 5:12 AM 


Dead 


© 




3258581 




Mav23 2002 11:02 AM 


Dead 






3258581 




Sep 21 2001 4:59 AM 


Dead 


© 


o 


3258581 




Jul 25 2001 4:56 AM 


Dead 


© 


o 


3258581 




Jun26 1998 8:38 AM , 


Dead 
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© 


1397339 


n/a 


Jun.29 1996 1:00 AM 


Dead 
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2 


Aug 27 2003 1:03 PM 


Live 
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6723674 


2 


Jan 20 2000 10:17 PM 


Dead 






34671 




Mar 9 1999 2:37 AM 


Dead 


<§) 




34671 




May 29 1996 2:35 AM 


Dead 






34671 




May 23 1995 8:46 PM 


Dead 




® 


34671 




Nov 30 1994 8:04 PM 


Dead 


® 




34671 




Sep 3 1993 3:02 PM 


Dead 
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Dead 
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34671 
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Dead 
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